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INTRODUCTION 


Autism  spectrum  disorders  (ASD)  differ  from  other  developmental  disorders  in  that  children  with 
ASD  show  little  interest  in  other  people.  This  lack  of  interest  is  associated  with  other  complex 
social  deficits,  including  empathy  and  shared  attention,  thus  further  disrupting  the  capacity  to 
engage  in  normal  social  interactions.  Research  findings  suggest  that  social  problems  in  ASD 
derive,  in  part,  from  dysfunction  in  the  neural  circuits  that  motivate  the  other-regarding  behaviors 
that  shape  normal  social  interactions.  Other-regarding  preferences  (ORPs)  describe  a  concern 
for  the  welfare  or  the  benefit  of  others.  ORPs  may  rely  on  empathy,  a  social-cognitive  capacity 
severely  compromised  in  ASD,  and  pathological  deficits  of  empathy  in  ASD  may  result  from  a 
failure  to  understand  others’  internal  states.  Accumulating  evidence  implicates  orbitofrontal 
cortex  (OFC)  dysfunction  in  the  pathophysiology  of  ASD.  Furthermore,  oxytocin  (OT),  a 
neuromodulatory  hormone  implicated  in  social  behavior  in  mammals,  has  also  been  implicated 
in  the  etiology  of  ASD.  Understanding  the  neuronal  properties  of  OFC  neurons  and 
demonstrating  OT-induced  changes  in  ORP  in  a  rhesus  macaque  model  will  significantly 
advance  our  understanding  of  social  processing  in  both  healthy  and  ASD  brains.  Our  research 
aims  to  develop  a  non-human  primate  model  for  ORP  specifically  designed  to  probe  these 
mechanisms  in  healthy  individuals,  the  neuronal  mechanisms  involved  in  the  expression  of 
ORPs,  and  the  efficacy  of  pharmacological  OT  therapies  designed  to  enhance  social  interaction 
in  ASD. 
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BODY 


Objectives  specified  in  the  approved  Project  Narratives 

•  Objective  1 :  Develop  an  animal  model  of  ORPs. 

•  Objective  2:  Determine  how  OFC  neurons  mediate  ORPs. 

•  Objective  3:  Determine  the  effects  of  OFC  perturbations  on  ORPs. 

•  Objective  4:  Determine  whether  OT  can  enhance  positive  ORP. 

By  tasks  indicated  in  Statements  of  Work  (SOW) 

Task  1.  Characterize  neural  responses  in  the  orbitofrontal  cortex  (OFC) 

We  have  completed  this  task.  Prior  to  the  beginning  of  investigating  neuronal  correlates 
of  ORPs  in  the  primate  OFC,  we  extensively  characterized  the  social  behaviors  of  rhesus 
macaques  in  the  ORP  task  (see  the  behavioral  setup  and  tasks  in  Appendix  1).  We  showed  that 
rhesus  macaques  (Macaca  mulatta)  spontaneously  derive  vicarious  reinforcement  from 
observing  rewards  given  to  another  monkey,  and  this  motivates  them  to  subsequently  deliver  or 
withhold  rewards  from  the  other  monkey  depending  on  their  social  relationship  and  other 
aspects  of  the  context.  We  exploited  both  Pavlovian  and  instrumental  conditioning  procedures 
to  associate  rewards  to  self  (donor  monkey)  and/or  rewards  to  another  monkey  (recipient 
monkey)  with  visual  cues.  Donor  monkeys  made  more  errors  (i.e.,  incomplete  trials)  in  the 
instrumental  trials  when  cues  predicted  reward  to  recipients  compared  to  when  cues  predicted 
reward  to  themselves,  but  made  even  more  errors  when  cues  predicted  reward  to  no  one.  In 
subsequent  preference  tests  between  pairs  of  previously  conditioned  cues,  the  donor  monkeys 
preferred  cues  paired  with  reward  to  the  recipients  over  cues  paired  with  reward  to  no  one.  By 
contrast,  the  donor  monkeys  preferred  cues  paired  with  reward  to  self  over  cues  paired  with 
reward  to  both  monkeys  delivered  at  the  same  time.  Rates  of  looking  at  recipients  strongly 
predicted  the  strength  and  valence  of  vicarious  reinforcement,  suggesting  a  role  for  attention  in 
gating  social  preferences.  Furthermore,  the  preference  to  donate  was  enhanced  by  greater 
familiarity  between  the  two  animals,  and  is  abolished  if  the  recipient  was  replaced  with  a  juice 
collection  bottle,  demonstrating  the  fundamentally  social  nature  of  the  task.  These  patterns  of 
behavior  are  consistent  with  vicarious  reinforcement  derived  from  observing  another  individual 
receive  a  reward.  Vicarious  reinforcement  in  rhesus  macaques  may  play  a  critical  role  in 
shaping  cooperation  and  competition,  as  well  as  motivating  observational  learning  and  group 
coordination,  much  as  it  does  in  humans  and  perhaps  other  highly  social  species.  We  propose 
that  vicarious  reinforcement  signals  mediate  these  behaviors  via  neural  circuits  involved  in 
reinforcement  learning  and  decision-making  that  are  shared  by  rhesus  macaques  and  humans. 

We  next  recorded  the  activity  of  85  single  OFC  neurons  in  two  donor  monkeys 
performing  the  ORP  task.  Figure  la  illustrates  the  OFC  region  of  the  brain  targeted  with 
electrodes  on  a  representative  coronal  slice  from  structural  magnetic  resonance  scans.  Figure 
1b  shows  a  typical  OFC  neuron  that  preferentially  encoded  juice  rewards  received  by  the  donor 
monkey  (i.e.,  self  rewards).  On  choice  trials,  this  neuron  responded  more  strongly  for  self 
rewards  than  for  the  alternatives  on  both  Self: Neither  (i.e.,  choosing  between  rewards  to  self 
and  neither)  and  SelfiOther  (i.e.,  choosing  between  rewards  to  self  and  other)  trials.  Activity 
associated  with  self  rewards  did  not  differ  between  SelfiNeither  and  SelfiOther  (7.00  ±  0.47, 
7.03  ±  0.46  sp/s,  respectively,  P  -  0.97,  Welch  two  sample  t- test),  but  it  significantly  exceeded 
the  cell’s  activity  for  other  and  neither  rewards  on  OtheriNeither  trials  (i.e.,  choosing  between 
delivering  rewards  to  the  recipient  [other  rewards]  and  delivering  rewards  to  neither  [neither 
rewards])  (3.06  ±  0.40,  1.85  ±  0.42  sp/s,  respectively;  both  P  <  0.0001).  On  cued  trials,  this 
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neuron  fired  most  to  self  rewards  compared  to  both  other  and  neither  rewards  (both  P  <  0.0001 , 
Welch  two  sample  t- test),  but  did  not  differentiate  between  other  and  neither  rewards  (P  =  0.25) 

(Fig.  1b). 

During  reward  delivery,  the  OFC  population  predominantly  encoded  self  rewards 
compared  to  other  or  neither  rewards.  The  response  for  self  rewards  was  30%  greater 
compared  to  other  rewards  (0.30  ±  0.09,  P  <  0.005,  paired  f-test),  and  the  bias  for  self  rewards 
over  neither  rewards  was  greater  by  17%  (0.17  ±  0.08,  P  <  0.05,  paired  f-test)  (Fig.  1c,  2a). 
Population  activity  for  other  and  neither  rewards  did  not  differ  in  OFC  (0.08  ±  0.06,  P  =  0.20, 
paired  f-test)  (Fig.  1c,  2a).  On  cued  trials,  the  self  reward  bias  over  other  rewards  was  absent 
(0.19  ±  0.16,  P  =  0.24,  paired  f-test),  and  the  self  reward  bias  was  only  weakly  present  over 
neither  rewards  (0.26  ±  0.15,  P  <  0.08).  On  cued  trials,  the  population  did  not  distinguish  other 
from  neither  rewards  (P  =  0.33,  paired  f-test)  (Fig.  1c).  Figure  3  shows  population  responses 
aligned  to  the  times  of  decision-making  and  cue  offset.  These  results  indicate  that  OFC  neurons 
predominantly  compute  rewards  received  by  donor  monkeys,  and  this  information  is  encoded 
more  faithfully  when  monkeys  actively  choose  whom  to  reward.  In  the  population,  reward  epoch 
responses  differed  significantly  for  a  large  number  of  neurons  depending  on  reward  outcome 
(57%),  trial  type  (45%)  and  reward  volume  (24%)  (analysis  of  variance,  ANOVA,  Methods; 
Table  1).  Based  on  the  statistical  significance  (ANOVA)  during  choice  and  reward  epochs,  we 
classified  individual  neurons  as  self-referenced,  other-referenced,  both-referenced,  or 
unclassified.  When  considering  the  proportion  of  different  cell  types  among  the  classified 
neurons  based  on  this  scheme,  78%  of  OFC  neurons  were  self-referenced,  whereas  only  10% 
were  other-referenced  and  12%  were  both-referenced  (both  P  <  0.0001,  x2  test)  (Fig.  2b). 

As  a  complementary  measure  of  neuronal  selectivity,  we  also  examined  whether  lower 
trial-to-trial  variability  was  associated  with  preferred  reward  outcomes  in  these  neurons.  We 
tested  whether  the  coefficient  of  variation  in  firing  rate  (CV;  Methods)  was  systematically  lower 
for  self  reward  outcomes  compared  to  other  and  neither  reward  outcomes.  We  indeed  found  this 
to  be  the  case.  The  OFC  population  showed  a  lower  CV  for  self  rewards  compared  to  others 
(received  -  omitted ,  -0.12  ±  0.04  [mean  ±  s.e.m.],  P  <  0.01,  one-sample  f-test)  (Fig.  4).  Thus, 
the  most  robust  responses  of  neurons  were  also  the  most  reliable. 

In  the  ORP  task,  donors  were  allowed  to  shift  their  gaze  at  the  recipient.  To  rule  out  the 
possibility  that  preferential  reward  responses  we  observed  were  simply  driven  by  where  the 
donors  looked  on  a  given  trial,  we  compared  reward  epoch  responses  on  trials  with  and  without 
gaze  shifts  to  the  recipient.  We  found  no  systematic  differences  in  the  reward  responses  of  OFC 
neurons  at  the  population  level  (P  >  0.20,  Wilcoxon  signed  rank  test)  (Fig.  5).  Thus,  the  reward- 
related  responses  in  OFC  cannot  be  simply  explained  by  preparation  to  look  at  the  recipient  or 
elicited  as  a  consequence  of  inspecting  the  recipient. 

To  test  whether  different  neuronal  frames  of  reference  (self-,  other-,  and  both- 
referenced)  were  anatomically  segregated  within  OFC,  we  applied  principal  component  analysis 
on  recording  coordinates  to  identify  and  test  the  distributions  in  the  major  axis  with  the  largest 
dispersion  within  three-dimensional  space.  We  did  not  observe  any  systematic  anatomical 
clustering  amongst  different  frames  of  reference  in  OFC,  indicating  that  self-,  other-,  and  both- 
referenced  neurons  within  OFC  were  intermingled  (all  P  >  0.56,  Wilcoxon  rank  sum  test).  A 
paper  based  in  part  upon  this  work  is  in  press  at  Nature  Neuroscience. 
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General  and  behavioral  procedures 


All  procedures  were  approved  by  the  Duke  University  Institutional  Animal  Care  and  Use 
Committee,  and  were  conducted  in  compliance  with  the  Public  Health  Service’s  Guide  for  the 
Care  and  Use  of  Laboratory  Animals. 

Two  donor  (MY  and  MO)  and  five  recipient  monkeys  ( Macaca  mulatta)  participated  in 
the  experiments.  Donors  (both  males)  and  recipients  (four  males,  one  female)  were  unrelated 
and  not  cagemates.  Donors  were  housed  in  a  colony  with  12  other  rhesus  macaques,  some  of 
which  were  pair-housed.  All  the  male  monkeys  reside  in  this  colony,  and  the  one  female 
monkey  resided  in  the  adjacent  colony  with  only  females.  For  all  monkeys,  a  sterile  surgery  was 
performed  prior  to  experiments  to  implant  a  head-restraint  prosthesis  (Crist  Instruments)  using 
standard  techniques.  Donor  monkeys  were  trained  on  the  ORP  task  in  the  presence  of  a 
recipient.  Subsequently,  a  second  surgery  was  performed  on  donors  to  implant  a  recording 
chamber  (Crist)  providing  access  to  OFC.  All  surgeries  were  performed  under  isoflourane 
anesthesia  (1-3%),  and  the  recording  chambers  were  regularly  cleaned,  treated  with  antibiotics 
and  sealed  with  sterile  caps. 

Horizontal  and  vertical  eye  positions  were  sampled  at  1000  Hz  using  an  infrared  eye 
monitor  camera  system  (SR  Research  Eyelink).  Visual  stimuli  were  controlled  by  PsychToolBox 
and  Matlab  (Mathworks).  Donors  and  recipient  monkeys  both  sat  in  primate  chairs  (Crist),  100 
cm  from  one  another  at  a  45-degree  angle.  Each  monkey  had  his  own  computer  monitor  which 
displayed  identical  visual  stimuli.  Both  the  donor  and  recipient  monkeys  had  their  own  juice  tube 
from  which  juice  drops  were  delivered.  In  order  to  prevent  monkeys  from  forming  secondary 
associations  of  solenoid  valve  clicks  or  the  sound  of  the  recipient  drinking  the  juice  reward,  the 
solenoid  valves  were  placed  in  another  room  and  white  noise  was  also  played  in  the 
background.  For  example,  experimenters  were  unable  to  hear  solenoids  anywhere  inside  the 
recording  room.  Critically,  a  separate  solenoid  designated  for  neither  rewards  was  also  placed 
outside  the  room;  it  only  produced  clicks  but  delivered  no  fluid.  The  face  region  of  the  recipient, 
with  respect  to  the  gaze  angle  of  the  donor  (horizontal  and  vertical  eye  positions),  was  mapped 
out  empirically  prior  to  the  experiments.  The  frequency  with  which  donors  looked  at  recipients 
was  computed  from  counting  the  number  of  gaze  shifts  to  the  recipient’s  face  (±  8.5°  from  the 
center  of  the  face)  (Chang  et  al. ,  2012a).  A  large  window  was  used  to  capture  gaze  shifts  that 
were  brief  in  duration  and  large  in  magnitude  and  often  directed  at  varying  depths  (e.g.,  eyes, 
mouth). 


Monkeys  performed  the  task  to  obtain  drops  of  cherry-  or  orange-flavored  juice.  Donors 
began  a  trial  by  shifting  gaze  (±  2.5°)  to  a  central  stimulus  (0.5°  x  0.5°),  and  maintained  fixation 
for  200  ms.  On  the  majority  of  sessions  (see  above),  the  reward  magnitude  at  stake  (0.1  - 
2.4ml)  on  each  trial  was  cued  by  the  position  of  a  horizontal  bisecting  line  (200ms),  indicating 
the  percentage  of  the  maximum  possible  volume.  There  were  two  kinds  of  trials,  choice  trials 
and  cued  trials.  Following  a  variable  delay  of  300,  500,  or  700  ms,  choice  and  cued  trials  were 
presented  at  equal  probabilities  and  randomly  interleaved.  On  choice  trials,  two  visual  targets 
(4°  x  4°)  appeared  at  two  random  locations  7°  eccentric  in  the  opposite  hemifield.  Donors  shifted 
gaze  to  one  target  (±  2.5°)  to  indicate  a  decision  within  the  maximum  allowed  time  of  1.5  s 
relative  to  the  stimulus  onset.  The  pair  of  stimuli  appearing  on  a  given  trial  was  drawn  from  the 
set  of  three  stimuli,  which  were  pseudorandomly  selected.  On  cued  trials,  donors  maintained 
fixation  (±  2.5°)  while  a  cue  (4°  x  4°)  appeared  centrally  to  the  screen  for  500  ms.  Cues 
indicating  rewards  for  the  donor,  recipient  or  neither  monkey  occurred  with  equal  frequency  and 
pseudorandomly  determined.  Reward  onset  was  followed  by  a  variable  0  -  900  ms  delay,  from 
the  time  of  either  making  a  choice  (choice  trials)  or  cue  offset  (cued  trials).  Donors  were  free  to 
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look  around  the  room  during  this  delay  and  for  another  one  second  after  reward  delivery. 
Reward  delivery  was  followed  by  an  intertrial  interval  of  700,  1,000,  or  1,300  ms.  Upon  making 
an  error,  both  monkeys  received  visual  feedback  (a  white  rectangle,  10°  x  10°)  followed  by  a  5s 
time  out  before  the  next  trial  began. 

Recording  procedures 

All  recordings  were  made  using  extracellular  tungsten  electrodes  (FHC).  Single 
electrodes  were  lowered  using  a  hydraulic  microdrive  system  (Kopf  Instruments,  or  FHC). 
Single-unit  waveforms  were  first  isolated,  and  action  potentials  collected,  using  a  16-channel 
recording  system  (Plexon,  Inc.).  In  order  to  guide  the  placement  of  recording  tracks  and  localize 
recording  sites,  we  acquired  structural  magnetic  resonance  images  (MRI)  (3T,  1  mm  slices)  of 
each  donor’s  brain  prior  to  the  experiments  (e.g.,  see  Fig.  la).  Detailed  localizations  were  made 
using  Osirix-viewer.  We  also  confirmed  that  electrodes  were  in  OFC  by  listening  to  grey-matter 
and  white-matter  associated  sounds  while  descending  the  electrodes  down  to  OFC.  OFC 
neurons  were  recorded  from  Brodmann  areas  13m  and  11  (Fig.  la).  A  total  of  85  OFC  neurons 
(MY:  46,  MO:  39)  were  included.  These  neurons  were  selected  for  recording  based  solely  on 
the  quality  of  isolation  upon  encountering.  For  a  small  subset  of  the  data  (27%),  data  were 
collected  in  a  task  with  a  fixed  reward  size  (typically  1.0ml  per  successful  trial).  For  the  majority 
of  the  cells  (42%),  data  were  either  collected  in  a  task  with  the  magnitude  cue,  or  both  with  and 
without  the  magnitude  cue  (i.e.,  two  or  more  consecutive  blocks  per  cell)  (31%).  We  combined 
the  two  types  of  data  in  our  analyses  unless  otherwise  specified. 

Data  analysis 

Data  from  each  cell  consisted  of  firing  rates  during  440  ±  13  (±217)  (median  ±  s.e.m. 
(±s.d.))  trials.  The  monkeys  performed  the  task  well,  as  evidenced  by  a  high  percentage  of 
completed  trials  even  on  trials  in  which  they  did  not  receive  juice  reinforcement  (e.g.,  also  see 
Chang  et  al. ,  2012a). 

Choice  preference  indices  were  constructed  as  contrast  ratios  (Eq.  1)  (Chang  et  al., 
2012a). 


Preference  Index  =  — - — — .  (1) 

P  a  +  P  B 

Ra  and  Rb  values  were  the  frequency  of  making  particular  choices.  For  Self  .Other  trials,  RA  and 
Rb  were  number  of  choices  to  reward  other  and  self ,  respectively.  For  OtheriNeither  trials,  RA 
and  Rb  were  number  of  choices  to  reward  other  and  neither ,  respectively.  Finally,  for 
SelfiNeither  trials,  RA  and  RB  were  number  of  choices  to  reward  neither  and  self ,  respectively. 
Indices  therefore  ranged  from  -1  to  1,  with  1  corresponding  to  always  choosing  to  donate  on 
OtheriNeither  trials  and  Self  .Other  trials,  and  always  choosing  not  to  reward  self  on  Self:  Neither 
trials.  An  index  of  -1  corresponds  to  the  opposite,  generally  stated  as  choosing  not  to  donate  to 
the  other  monkey  or  choosing  to  reward  oneself.  Values  of  0  indicated  indifference.  For 
constructing  neuronal  preferences,  we  simply  substituted  the  choice  frequency  with  neuronal 
firing  rates  associated  with  making  specific  decisions.  Response  times,  the  time  from  the  onset 
of  choices  to  movement  onset,  were  computed  using  a  207sec  velocity  threshold  criterion 
(Chang  et  al.,  2012a) . 

Firing  rates  were  computed  during  the  reward  epoch  (from  50  to  600ms  from  reward 
onset)  as  well  as  for  the  choice  epoch  (from  -100  to  400ms  from  making  a  choice).  For  the 
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population  analyses,  we  normalized  reward  epoch  firing  rates  to  the  average  baseline  rates  for 
each  reward  outcome  (300  ms  interval  prior  to  fixation  onset).  Using  marginally  different  time 
windows  and  different  normalization  methods  all  resulted  in  similar  conclusions.  Coefficients  of 
variation  (CV)  were  calculated  for  each  neuron  based  on  the  standard  deviation  (a)  and  mean 
(p)  using  the  spike  rates  (sp/s)  from  the  reward  epoch  (Eq.  2): 

CV=-  (2) 

/x 

We  used  analysis  of  variance  (ANOVA)  to  classify  the  reward  response  selectivity  of 
individual  neurons  per  individual  cells.  Two-factor  ANOVA  was  used  to  classify  the  selectivity  of 
reward  outcome  (self,  other,  or  neither)  and  trial  type  (choice  or  cued).  Three-factor  ANOVA 
was  used  to  classify  the  selectivity  of  reward  volume  (binned  into  small,  medium,  large)  for  the 
cells  that  were  collected  in  the  task  with  a  magnitude  cue.  Statistical  significance  for  each 
reward  type  was  computed  by  Tukey  HSD  test.  Across  all  analyses,  using  slightly  different 
epoch  durations  for  neuronal  data  analyses  led  to  similar  results  and  conclusions. 

Classification  of  cell  types  by  significant  reward  specificity 

Based  on  Tukey  HSD  tests,  we  classified  cells  into  the  following  categories:  self- 
referenced,  other-referenced,  both-referenced  (mirror),  and  unclassified.  These  categories  do 
not  imply  functional  roles  but  indicate  that  firing  rates  were  significantly  different  based  on 
reward  outcomes.  A  neuron  was  referred  as  self-referenced  if  the  responses  of  the  neuron  were 
significantly  different  (P  <  0.05)  between  self  and  other  rewards  as  well  as  between  self  and 
neither  rewards,  but  not  different  between  other  and  neither  rewards.  A  neuron  was  referred  as 
other-referenced  if  the  responses  of  the  neuron  showed  significant  differences  in  firing  rates 
between  self  and  other  rewards  as  well  as  between  other  and  neither  rewards,  but  not  different 
between  self  and  neither  rewards.  Finally,  a  neuron  was  referred  as  both-referenced  or  mirror  if 
the  responses  of  the  neuron  showed  significant  differences  in  responses  between  self  and 
neither  rewards  as  well  as  other  and  neither  rewards,  but  not  different  between  self  and  other 
rewards.  Neurons  not  belonging  to  one  of  these  categories  were  considered  as  unclassified. 
Applying  slightly  different  criteria  or  differently  configured  ANOVA  variables  did  not  change  the 
overall  results. 

Task  2.  Characterize  behavior  after  muscimol  inactivation 

We  plan  to  reversibly  inactivate  OFC  neurons  with  muscimol  next  year  and  examine  the 
effects  on  performance  on  the  ORP  task  and  social  attention. 


Task  3.  Determine  behavioral  response  to  microstimulation 

We  plan  to  microstimulate  OFC  neurons  this  year  and  examine  the  effects  on 
performance  on  the  ORP  task  and  social  attention. 


Task  4.  Examine  the  effect  of  oxytocin  (OT)  on  task  performance 

We  have  completed  this  task.  We  showed  that  inhaled  OT  (using  pediatric  nebulizer) 
penetrates  the  central  nervous  system  and  subsequently  enhances  the  sensitivity  of  rhesus 
macaques  to  rewards  occurring  to  others  as  well  as  themselves  in  the  ORP  task.  Roughly  2 
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hours  after  inhaling  OT,  donor  monkeys  increased  the  frequency  of  prosocial  choices 
associated  with  reward  to  another  monkey  (i.e.,  recipient  monkey)  when  the  alternative  was  to 
reward  no  one.  OT  also  increased  attention  to  the  recipient  monkey  (i.e.,  number  of  gazes 
directed  at  the  recipient  following  decisions)  as  well  as  the  time  it  took  to  make  such  a  decision 
(i.e.,  eye  movement  choice  reaction  times).  In  contrast,  within  the  first  2  hours  following 
inhalation,  OT  enhanced  selfish  choices  associated  with  delivery  of  reward  to  self  over  a  reward 
to  the  other  monkey,  without  influencing  attention  or  decision  reaction  times.  Despite  the 
differences  in  species  typical  social  behavior  between  rhesus  macaques  and  more  egalitarian 
and  monogamous  species  (like  prairie  voles  and  humans),  exogenous,  inhaled  OT  causally 
promotes  social  donation  behavior  in  rhesus  monkeys  when  there  is  no  perceived  cost  to  self. 
These  findings  potentially  implicate  shared  neural  mechanisms  and  validate  the  use  of  inhaled 
OT  as  a  potential  therapeutic  for  enhancing  social  attention  and  prosocial  behavior  in  ASD.  The 
detailed  methods,  results,  and  figures  can  be  found  in  the  Appendix  1  and  were 
published  in  Proceedings  of  the  National  Academy  of  Sciences  in  2012  (Chang  et  al., 
2012a). 
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KEY  RESEARCH  ACCOMPLISHMENTS 


•  Development  of  intranasal  oxytocin  (OT)  protocol  in  rhesus  monkeys  with  a  confirmation 
that  the  method  effectively  delivers  OT  to  the  central  nervous  system  (Meeting 
Presentations,  Paper  Published:  Chang  et  al. ,  2012a) 

•  Demonstration  of  OT-induces  changes  in  ORPs  in  rhesus  monkeys  (Presented  at 
multiple  meetings)  (Meeting  Presentations,  Paper  Published:  Chang  et  al.,  2012a) 

•  Recording  of  neuronal  activity  from  the  orbitofrontal  cortex  (OFC)  during  the  ORP  task 
(Meeting  Presentations,  Paper  currently  in  press :  Chang  et  al.,  in  press) 

•  Extension  of  the  current  experiments  to  other  prefrontal  brain  regions  (the  sulcus  and 
gyrus  of  the  anterior  cingulate  cortex)  to  better  understand  how  ORP-related  signals 
differ  across  different  parts  of  the  prefrontal  cortex  (Acquired  new  funding  for  future 
research  [see  REPORTABLE  OUTCOMES]) 
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REPORTABLE  OUTCOMES 


A.  Publications 

1.  Chang  SW,  Gariepy  JF,  and  Platt  ML  (in  press)  Neuronal  reference  frames  for  social 
decisions  in  primate  frontal  cortex.  Nature  Neuroscience. 

2.  Gariepy  JF,  Chang  SW  and  Platt  ML  (in  press)  Brain  games:  Toward  a  neuroecology  of 
social  behavior.  Invited  commentary  in  Beh.  Brain.  Sci.,  in  press. 

3.  Chang  SW,  Barter  JW,  Ebitz  RB,  Watson  KK  and  Platt  ML  (2012a)  Inhaled  oxytocin 
amplifies  both  vicarious  reinforcement  and  self  reinforcement  in  rhesus  macaques 
(Macaca  mulatta).  Proc  Natl  Acad  Sci,  109,  959-964. 

4.  Chang  SW,  Barack  DL  and  Platt  ML  (2012b)  Mechanistic  classification  of  neural  circuit 
dysfunctions:  Insights  from  neuroeconomics  research  in  animals.  Biol.  Psychiatry , 
72:101-106. 


B.  Meeting  Abstracts 

1.  Chang  SW,  Gariepy  JF,  and  Platt  ML.  Differential  encoding  of  social  decision  outcomes 
by  neurons  in  primate  orbitofrontal  cortex,  dorsal  anterior  cingulate  cortex  and  anterior 
cingulate  gyrus.  Society  for  Neuroscience  (New  Orleans,  LA),  2012  (Talk)  &  Contributed 
Talk  at  Society  for  Social  Neuroscience,  2012 

2.  Chang  SW,  Gariepy  JF,  and  Platt  ML.  Neuronal  reference  frames  for  social  decisions  in 
primate  prefrontal  cortex.  Organization  for  Computational  Neuroscience  (Atlanta,  GA), 
2012  (Poster) 

3.  Platt,  ML.  Neuronal  basis  of  giving  and  receiving.  Organization  for  Computational 
Neuroscience  (Atlanta,  GA),  2012,  invited  talk. 

4.  Chang  SW  and  Platt  ML.  Differential  coding  of  egocentric  and  allocentric  reward 
outcomes  during  social  interaction  in  primate  ACC  and  OFC.  Society  for  Neuroscience 
(Washington,  DC),  2011  (Talk) 

5.  Chang  SW,  Barter  JW,  Ebitz  RB,  Watson  KK  and  Platt  ML.  Oxytocin  promotes  prosocial 
decisions  in  rhesus  macaques.  Society  for  Neuroscience  (Washington,  DC),  2011 
(Poster) 

6.  Chang  SW,  Barter  JW,  Ebitz  RB,  Watson  KK  and  Platt  ML.  Inhaled  oxytocin  amplifies 
both  vicarious  reinforcement  and  self  reinforcement  in  rhesus  macaques  (Macaca 
mulatta).  Workshop  on  the  Biology  of  Prosocial  Behavior  at  Emory  University  (Atlanta, 
GA),  2011  (Poster) 


C.  Research  Support  (built  upon  this  award) 
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1.  NIH/NIMH  2/21/12-11/30/16 

R01  MH095894-01  (Platt) 

Neuronal  basis  of  vicarious  reinforcement  dysfunction  in  autism  spectrum  disorder 
The  goal  of  this  project  is  to  understand  the  role  of  prefrontal  cortex  in  mediating 
vicarious  reinforcement  during  reward  allocation  decisions 

2.  Duke  Department  of  Neurobiology  6/01/11-5/31/12 

Postdoctoral  Training  Award  in  Fundamental  &  Translational  Neuroscience 
NIH/NINDS  T32  NS051 156-07  (Chang) 

Neural  basis  of  other-regarding  preference 

The  goal  of  this  project  is  to  understand  the  role  of  anterior  cingulate  cortex 
and  orbitofrontal  cortex  during  reward  allocation  decisions 

3.  NIH/NIMH  Pending 

NIH  K99/R00  Pathway  to  Independence  (Chang) 

Role  of  oxytocin  in  the  amygdala-prefrontal  network  during  social  decision-making 
The  goal  of  this  project  is  to  undergo  extensive  training  in  neuroendocrinology,  and  study 
the  mechanisms  underlying  oxytocin-mediated  neural  processing  across  amygdala  and 
prefrontal  neurons  in  social  decision-making. 


D.  Graduate  student  mentoring 

1.  A  successful  rotation  project  for  a  Duke  Cognitive  Neuroscience  PhD  candidate,  Amy  A. 
Winecoff. 

2.  A  successful  rotation  project  for  a  Duke  Cognitive  Neuroscience  PhD  candidate,  Joseph 
W.  Barter,  resulting  in  a  second  authorship  in  Chang  et  al.,  2012a. 

3.  Mentored  Jean-Francois  Gariepy,  visiting  graduate  student  from  U.  Montreal,  resulting  in 
the  development  of  a  novel  social  interaction  task  in  two  rhesus  monkeys  and  the 
neuronal  recording  of  strategic  interaction  signals  from  the  dorsolateral  prefrontal  cortex. 
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CONCLUSION 


Other-regarding  preferences  (ORPs)  are  critical  for  normal  social  behavior,  and  the 
neural  mechanisms  underlying  ORPs  may  be  disrupted  in  neuropsychiatric  disorders  marked  by 
social  deficits,  including  the  autism  spectrum  disorders  (ASD).  Both  motivation-related 
processing  in  the  brain  and  the  neuropeptide  oxytocin  (OT)  have  been  implicated  in  ASD. 
However,  the  neural  mechanisms  underlying  ORPs  remain  elusive,  partly  due  to  the  lack  of  a 
good  animal  model  for  studying  complex  social  behavior.  To  address  this  gap,  we  developed  a 
novel  social  interaction  task  involving  two  rhesus  macaques,  and  have  investigated  the  role  of 
OFC  neurons,  previously  implicated  in  motivation  and  decision-making,  as  well  as  the 
neuropeptide  OT,  which  has  previously  been  implicated  in  social  preferences,  during  the 
expression  of  ORPs. 

We  found  that  rhesus  monkeys  care  about  what  happens  to  others,  as  indicated  by  their 
preference  to  deliver  juice  rewards  to  a  recipient  monkey  over  no  one  and  the  increased  social 
looking  behavior  associated  with  such  prosocial  decisions.  Furthermore,  prosocial  preferences 
are  enhanced  following  inhalation  of  OT  by  monkeys.  Neuronal  recording  from  OFC  revealed  a 
strong  preference  for  computing  rewards  delivered  to  self  compared  to  rewards  delivered  to 
either  another  monkey  or  no  one,  indicating  that  OFC  neurons  track  self  motivation  during  social 
interactions.  Furthermore,  this  self  reward  preference  in  OFC  was  more  faithfully  encoded 
following  active  decisions  by  donor  monkeys,  consistent  with  a  motivational  function  of  OFC 
with  respect  to  self.  Our  results  begin  to  reveal  how  the  primate  brain  makes  decisions  during 
social  interaction  with  other  individuals. 

OT  has  been  evaluated  for  potential  therapeutic  use  in  clinical  conditions  marked  by 
social  deficits,  such  as  ASD,  antisocial  personality  disorder,  and  schizophrenia.  Notably,  the 
intranasal  nebulization  method  we  developed,  demonstrated  the  efficacy  of,  and  applied  here  is 
well-tolerated  by  children  for  delivery  of  other  therapeutics  (e.g.,  albuterol),  thus  opening  up 
avenues  for  early  OT  intervention  in  neuropsychiatric  conditions  with  social  deficits. 

Our  findings  provide  new  opportunities  for  uncovering  the  neurophysiological  and 
neuroendocrinological  mechanisms  underlying  complex  social  behavior  in  a  species  much  more 
closely  related  to  humans  than  mice  or  rats.  Rhesus  monkeys  have  long  served  as  the  preferred 
model  species  for  probing  the  neural  mechanisms  underlying  complex  cognition.  Given  the 
strong  similarities  in  social  behavior  and  cognition,  together  with  remarkable  homologies  in 
neural  circuitry,  the  rhesus  macaque  provides  a  powerful  model  for  probing  the  neurobiological 
mechanisms  of  social  interactions  in  people. 


Medical  Implications  (“So  What”  section) 

Our  work  holds  promise  both  for  understanding  the  basic  mechanisms  that  support 
complex  social  behavior  and  translating  that  knowledge  into  improved  treatment  for  social 
dysfunction  in  ASD.  In  particular,  our  work  tests  the  idea  that  empathy  derives  from  the 
activation  of  neural  circuits  that  process  primary  emotions  or  feelings,  such  as  reward  or 
punishment,  merely  by  observing  the  same  things  happen  to  other  people.  OT  therapy  for  ASD 
and  other  neuropsychiatric  disorders  is  currently  being  explored  in  clinical  trials,  despite 
uncertainty  regarding  the  exact  mechanism  of  action  in  the  brain  or  the  long-term  consequences 
of  use.  By  testing  this  drug  in  an  animal  model,  we  can  directly  confirm  efficacy,  efficiency,  and 
long-term  safety.  Clinicians  can  use  this  information  to  directly  inform  therapeutic  interventions 
in  ASD.  We  have  already  demonstrated,  for  the  first  time  in  any  species,  that  inhaled  OT  is 
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taken  up  by  the  central  nervous  system — an  important  prerequisite  for  exploring  further  clinical 
opportunities. 

Our  work  promises  ancillary  benefits  as  well.  Our  findings  regarding  the  functional  role  of 
the  OFC  will  also  be  of  use  in  clinical  contexts.  The  precise  way  OFC  contributes  to  social 
behavior  remains  a  mystery.  By  delineating  the  neuronal  properties  of  neurons  in  OFC,  we  can 
further  understand  how  this  brain  area  contributes  to  the  expression  of  complex  social  behavior 
as  well  as  its  dysfunction.  Such  insights  may  prove  invaluable  in  the  diagnosis  and  treatment  of 
social  behavioral  disorders  that  accompany  head  trauma,  with  potentially  important  implications 
for  veterans  of  US  armed  forces  returning  from  the  battlefield  suffering  from  traumatic  brain 
injuries  and  attendant  problems  in  adjusting  to  civilian  life  and  society. 

Ultimately,  the  results  of  these  experiments  will  inform  interventions  for  social  disorders, 
on  both  the  pharmacological  and  behavioral  levels,  and  significantly  improve  the  lives  of  people 
living  with  ASD  and  other  individuals  struggling  with  normal  social  life. 
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SUPPORTING  DATA  (1  Table  and  5  Figures) 


Table  1 


Table  1.  Classification  of  the  reward  type,  trial  type,  and  reward  size  selectivities 
at  the  level  of  individual  neurons  from  OFC  based  on  analysis  of  variance. 


Area 

Proportion  of 
significant  neurons 
by  factors 
(reward  epoch) 

Proportion  of 
significant  neurons 
between  different  rewards 
(reward  epoch) 

Proportion  of 
reference  frame  types 
(reward  or  choice  epoch) 

OFC 

Reward  Outcome 

57%  (77=85) 

Self  vs.  Neither  37%  (n= 85) 

SELF  frame 

38%  (77=85) 

Trial  Type 

45%  (77=85) 

of  reference 

[78%] 

Reward  Volume 

24%  (77=62) 

Self  vs.  Other  42%  (n= 85) 

Reward  Outcome  x 

37%  (n=85) 

OTHER  frame 

5%  (77=85) 

Trial  Type 

Other  vs.  Neither  1 4%  (n= 85) 

of  refernece 

[10%] 

Reward  Volume  x 

10%  (77=62) 

Reward  Outcome 

Self  (Self .Other)  1 3%  (77=85) 

BOTH  frame 

6%  (77=85) 

Reward  Volume  x 

10%  (;t=62) 

vs.  Self  (Self:Neither) 

of  reference 

[1 2%] 

Trial  Type 

(Mirror) 

Reward  Outcome  x 

11%  (77=62) 

Trial  Type  x 

Unclassified 

51%  (77=85) 

Reward  Volume 

The  percentages  shown  inside  the  brackets  on  the  4th  column  show  the  proportions  out  of  classfied  neurons.  Significance  in 
all  panels  was  based  on  P<  0.05  (analysis  of  variance  and  tukey  HSD  tests). 
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Reward  On 


Color  schems  for  PSTHs 


,  Self  reward  (Self:Other) 

,  Self  reward  (Self:Neither) 
,  Other  reward 
i  Neither  reward 
>  Cued  Self  reward 
'  Cued  Other  reward 
i  Cued  Neither  reward 


OFC 


Relative  time  (sec) 
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Figure  1.  OFC  recording  sites,  and  single  and  population  OFC  responses  during  the  ORP  task, 
(a)  Structural  magnetic  resonance  image  (MRI)  from  donor  MO,  with  example  electrode  paths 
shown  for  OFC.  (b)  Mean  responses  (peri-stimulus  time  histograms  [PSTHs])  and  spike  rasters 
for  a  self  reward  preferring  OFC  neuron.  Choice  trials:  upper,  solid  traces.  Cued  trials:  lower, 
dashed  traces.  Data  are  aligned  to  reward  onset  for  each  reward  outcome.  Histograms  on  right 
show  mean  ±  s.e.m.  activity  from  reward  epoch  (grey  box).  Color  codes  for  PSTH  traces  and 
histograms  are  shown  below,  (c)  Normalized  reward  epoch  responses  for  85  OFC  neurons. 
Same  format  as  in  b.  In  all  histograms,  the  horizontal  lines  above  different  conditions  indicate 
significance  differences  (solid,  P  <  0.05  by  paired  f-test;  dashed,  P  <  0.05  by  bootstrap  test). 
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Figure  2.  Population  biases  for  self ,  other,  and  neither  reward  outcomes  in  OFC  neurons,  (a) 
Scatter  plots  show  mean  normalized  reward  epoch  responses  of  individual  neurons  between 
self  and  other  rewards  (left),  between  other  and  neither  rewards  (middle),  and  between  self 
rewards  from  Self:Neither  and  Self:Other  contexts  (right).  The  example  neuron  from  Fig.  1b  is 
indicated  on  the  scatter  plots,  (b)  Proportion  of  neurons  (out  of  significantly  classified  neurons 
according  to  ANOVA)  from  OFC  using  self-referenced,  other-referenced,  and  both-referenced 
(mirror)  frames  for  representing  reward  outcomes.  Inset  shows  color  codes  used  in  the  bar 
graph.  The  asterisk  on  the  bars  indicates  significant  differences  in  proportions  (P  <  0.05,  x2 
test). 
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Figure  3  Time  courses  of  the  OFC  population  responses  aligned  to  the  time  of  choice  onset  and 
the  time  of  making  a  choice  for  choice  trials,  and  aligned  to  the  time  of  cue  onset  and  cue  offset 
for  cued  trials.  Normalized  mean  responses  (PSTHs)  of  85  OFC  neurons  are  plotted  over  time 
for  each  alignment.  The  inset  shows  the  color  scheme.  Same  format  as  in  Fig.  1b,c. 
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Figure  4  Differences  in  coefficient  of  variation  (CV)  across  different  reward  outcomes  also 
reflect  self  reward  bias  in  OFC.  Plotted  are  differences  in  CV  between  a  pair  of  reward 
categories  (as  indicated  on  the  right  of  each  distribution).  We  compared  individual  neuron 
averages  of  all  trials  in  which  the  donors  received  rewards  against  all  trials  in  which  the  donors 
did  not  receive  rewards  ( Received  -Omitted)  (top).  We  also  compared  individual  neuron 
averages  of  trials  in  which  the  recipient  received  the  rewards  against  trials  in  which  no  one 
received  rewards  ( Other  -  Neither)  (middle),  and,  finally,  between  trials  in  which  the  donors 
received  rewards  in  Self:Neither  against  Self:Other  contexts  (Self  (Self:Neither)  -  Self 
(Self:Other))  (bottom).  If  applicable,  the  data  were  collapsed  across  choice  and  cued  trials  for 
this  analysis.  Data  points  are  jittered  in  the  vertical  dimensions  for  visibility.  The  asterisk  above 
the  data  points  indicates  significance  (*:  P  <  0.10,  one  sample  f-test)  in  the  distribution. 
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Figure  5  Reward  coding  is  not  simply  driven  by  gaze  shifts  directed  at  the  recipient.  Shown  are 
histograms  of  the  differences  in  normalized  reward  epoch  responses  between  trials  with  gaze 
shifts  and  without  gaze  shifts  (responses  ‘with’  -  responses  ‘without’  gaze  shifts),  for  trials  in 
which  rewards  were  delivered  to  self  (top),  other  (middle),  or  neither  (bottom),  for  OFC 
populations.  Arrows  indicate  distribution  means. 
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Statement  of  Work  (SOW) 


All  monkey  experimental  sessions  will  take  place  in  Dr.  Michael  Platt’s  dedicated 
laboratory  space  in  Duke  University’s  Vivarium  facility.  Data  analysis  will  be  conducted 
in  Duke  University’s  Levine  Science  Research  Center. 

Platt  Laboratory 

Center  for  Cognitive  Neuroscience 
Duke  University 
Box  90999 
Durham,  NC  27708 

The  laboratory’s  current  approved  protocol  allows  for  all  experiments  proposed  in  this 
application,  so  no  period  for  regulatory  review  is  necessary. 

MATLAB  scripts  for  task  presentation  and  behavioral  analysis  were  developed  prior  to 
the  award  period,  during  the  collection  of  preliminary  data. 


Timeline: 

Proposed  start  date:  07/01/201 1 
Proposed  end  date:  06/30/2014 

Pre-Task.  USAMRMC  ROP  IACUC  documentation  (7/2011  -  10/2011) 

Task  1.  Characterize  neural  responses  in  the  orbitofrontal  cortex  (OFC)  (4 
monkeys)  (1 0/201 1  -11/2012) 

la.  Record  cells  in  OFC  while  monkeys  perform  the  other-regarding  preference 
(ORP)  task  (10/201 1  -  3/2012) 

lb.  Analyze  recording  data  (3/2012  -  7/2012) 

a)  Characterize  activity 

b)  Determine  response  epochs  for  primary  analyses 

c)  Correlate  neural  activity  with  behavior 

lc.  Prepare  &  submit  manuscript  on  OFC  recording  data  (7/2012  -  1 1/2012) 

Task  2.  Characterize  behavior  after  muscimol  inactivation  (4  monkeys)  (1/2013  - 
9/2013) 

2a.  Administer  muscimol  during  task  performance  (1/2013  -  4/2013) 

2b.  Analyze  behavioral  data  (4/2013  -6/2013) 

a)  Determine  the  effect  of  muscimol  on  behavioral  choices 

b)  Determine  the  effect  of  muscimol  on  gaze  patterns 

2c.  Prepare  &  submit  manuscript  on  OFC  inactivation  (6/2013  -  9/2013) 

Task  3.  Determine  behavioral  response  to  microstimulation  (4  monkeys)  (9/2013 
-6/2014) 

3a.  Stimulate  OFC  during  task  performance  (9/2013  -  2/2014) 


3b. 


Analyze  behavioral  data  (12/2013  -  3/2014) 

a)  Determine  the  effect  of  microstimulation  on  behavioral  choices 

b)  Determine  the  effect  of  microstimulation  on  gaze  patterns 

c)  Determine  the  level  of  perturbation  and  time-dependency 

3c.  Prepare  &  submit  manuscript  on  OFC  microstimulation  (3/2014  -  6/2014) 

Task  4.  Examine  the  effect  of  oxytocin  (OT)  on  task  performance  (4  monkeys) 
(7/2011  -2/2012) 

4a.  Deliver  OT  during  task  performance  (7/201 1  -  12/2011) 

4b.  Analyze  behavioral  data  (9/201 1  -  1 2/201 1 ) 

a)  Determine  the  effect  of  OT  on  behavioral  choices 

b)  Determine  the  effect  of  OT  on  gaze  patterns 

c)  Examine  the  time  course  of  the  OT  effect 

4c.  Prepare  &  submit  manuscript  on  OT  effect  (10/201 1  -  2/2012) 


Inhaled  oxytocin  amplifies  both  vicarious 
reinforcement  and  self  reinforcement  in 
rhesus  macaques  (Macaca  mulatta) 

Steve  W.  C.  Chang3,6,1,  Joseph  W.  Barter6,  R.  Becket  Ebitz3,6,  Karli  K.  Watson3,6,  and  Michael  L.  Platt3,b,c,d 

department  of  Neurobiology,  Duke  University  School  of  Medicine,  Durham,  NC  27701;  and  bCenter  for  Cognitive  Neuroscience  and  Departments  of 
'Evolutionary  Anthropology  and  dPsychology  and  Neuroscience,  Duke  University,  Durham,  NC  27708 

Edited  by  Bruce  S.  McEwen,  The  Rockefeller  University,  New  York,  NY,  and  approved  December  5,  2011  (received  for  review  September  6,  2011) 


People  attend  not  only  to  their  own  experiences,  but  also  to  the 
experiences  of  those  around  them.  Such  social  awareness  profoundly 
influences  human  behavior  by  enabling  observational  learning,  as 
well  as  by  motivating  cooperation,  charity,  empathy,  and  spite. 
Oxytocin  (OT),  a  neurosecretory  hormone  synthesized  by  hypotha¬ 
lamic  neurons  in  the  mammalian  brain,  can  enhance  affiliation  or 
boost  exclusion  in  different  species  in  distinct  contexts,  belying  any 
simple  mechanistic  neural  model.  Here  we  show  that  inhaled  OT 
penetrates  the  CNS  and  subsequently  enhances  the  sensitivity  of 
rhesus  macaques  to  rewards  occurring  to  others  as  well  as  them¬ 
selves.  Roughly  2  h  after  inhaling  OT,  monkeys  increased  the 
frequency  of  prosocial  choices  associated  with  reward  to  another 
monkey  when  the  alternative  was  to  reward  no  one.  OT  also 
increased  attention  to  the  recipient  monkey  as  well  as  the  time  it 
took  to  render  such  a  decision.  In  contrast,  within  the  first  2  h 
following  inhalation,  OT  increased  selfish  choices  associated  with 
delivery  of  reward  to  self  over  a  reward  to  the  other  monkey, 
without  affecting  attention  or  decision  latency.  Despite  the  differ¬ 
ences  in  species  typical  social  behavior,  exogenous,  inhaled  OT 
causally  promotes  social  donation  behavior  in  rhesus  monkeys,  as 
it  does  in  more  egalitarian  and  monogamous  ones,  like  prairie  voles 
and  humans,  when  there  is  no  perceived  cost  to  self.  These  findings 
potentially  implicate  shared  neural  mechanisms. 

social  decision-making  |  neuropeptide  |  other-regarding  preference 
social  gaze 

Oxytocin  (OT)  (1)  is  a  mammalian  neurosecretory  hormone, 
synthesized  by  hypothalamic  neurons,  which  regulates  the 
hypothalamic-pituitary-adrenal  axis  (2).  The  most  well-under- 
stood  role  of  OT  in  mammals  is  in  female  reproduction,  with 
peripheral  OT  influencing  parturition  and  lactation  (3),  and 
central  OT  affecting  mother-offspring  bonding  and  recognition 
(4,  5).  More  recently,  OT  has  been  found  to  influence  non- 
parental  social  behavior  in  a  species-specific  manner.  For  ex¬ 
ample,  OT  promotes  pair-bonding  between  males  and  females  in 
monogamous  prairie  voles  ( Microtus  ochrogaster)  (6,  7)  but  can 
also  increase  aggression  (i.e.,  mate -guarding  behavior)  and  de¬ 
crease  social  interaction  among  females  after  brief  exposure  to 
a  male  (8).  In  humans,  OT  also  influences  more  complex  forms 
of  social  behavior  and  cognition  (9-14).  For  example,  inhaled 
OT  enhances  trusting  behavior  toward  other  individuals  in  eco¬ 
nomic  games,  potentially  by  suppressing  aversion  to  betrayal  risk 
(15),  and  promotes  cooperation  within  groups  (16).  However, 
inhaled  OT  also  provokes  cultural  and  racial  biases  (17).  OT 
inhalation  also  enhances  sensitivity  to  the  experiences  of  others 
by  promoting  vicarious  reward  and  empathic  pain  (10,  18,  19). 
Recently,  OT-mediated  processes  have  been  implicated  in  dis¬ 
orders  attended  by  dysfunctional  social  behavior,  including  au¬ 
tism,  fragile  X  syndrome,  and  schizophrenia  (19-22).  Notably, 
OT  treatment  improves  social  skills  in  individuals  with  autism 
(21,  23,  24),  a  spectrum  of  disorders  with  marked  deficits  in 
sensitivity  to  what  happens  to  others,  including  impairments 
in  understanding  and  responding  to  social  cues  (22,  25,  26). 


Variations  in  a  common  oxytocin-receptor  allele  are  linked  to 
autism  spectrum  disorders  and  are  associated  with  reduced  vol¬ 
ume  in  hypothalamus  and  anterior  cingulate  cortex  (27). 

Despite  a  growing  literature,  the  mechanisms  mediating  the 
influence  of  OT  on  sensitivity  to  what  happens  to  others  remain 
only  partially  understood  (9,  14,  19,  21,  28,  29).  OT  receptors  are 
localized  in  multiple  regions  of  the  brain,  with  especially  high 
density  in  areas  implicated  in  affective  and  social  processing.  In 
prairie  voles,  OT  receptors  are  densely  localized  in  the  amygdala, 
prelimbic  cortex  (homologous  to  the  cingulate  cortex  in  pri¬ 
mates),  and  nucleus  accumbens  of  the  striatum  (30).  Recently,  it 
has  been  shown  that  OT  selectively  inhibits  a  dedicated  channel 
from  the  central  nucleus  of  the  amygdala  to  periaqueductal  gray, 
ultimately  reducing  fear-induced  freezing  behavior  in  rats  (31). 
Similarly,  in  humans,  inhaled  OT  influences  on  social  behavior 
are  associated  with  reduced  blood  oxygen  level-dependent  (BOLD) 
signals  in  the  bilateral  amygdala  and  dorsal  striatum  (28,  29), 
consistent  with  the  OT-mediated  negative  affect  processing  in 
the  amygdala-cingulate  circuits  (22).  These  studies  provide  evi¬ 
dence  that  OT  influences  information  processing  in  neural  cir¬ 
cuits  implicated  in  emotion  and  social  behavior. 

Unlike  prairie  voles  or  humans  (2,  6,  9-11,  13-16,  30,  32,  33), 
rhesus  macaques  ( Macaca  mulatta)  live  in  large,  hierarchical 
social  groups  with  promiscuous  mating  and  uniparental  female 
care  of  offspring.  Precisely  how  OT  might  influence  social  cog¬ 
nition  in  animals  with  this  type  of  social  structure  and  mating 
system,  if  at  all,  remains  unknown.  To  answer  this  question,  we 
capitalized  on  a  recent  finding  by  our  group  showing  that  rhesus 
macaques  are  sensitive  to  the  rewards  experienced  by  others,  and 
this  vicarious  reinforcement  is  sufficient  to  motivate  them  to 
work  to  reward  another  monkey  when  the  alternative  is  de¬ 
livering  reward  to  no  one  (34).  We  found  that  inhaling  OT  in¬ 
creased  OT  levels  in  cerebral  spinal  fluid  (CSF),  demonstrating 
transnasal  penetration  into  the  CNS.  Roughly  2  h  after  OT-in- 
halation  and  onward,  donor  monkeys  selectively  increased  the 
frequency  of  choosing  an  option  resulting  in  reward  to  an  adja¬ 
cent,  visible  monkey,  when  the  alternative  was  rewarding  no  one. 
In  the  same  context,  OT  also  increased  the  frequency  that 
donors  looked  at  the  recipient  monkey  and  prolonged  choice 
response  times.  In  contrast,  up  to  about  2  h  postinhalation,  OT 
increased  selfish  decisions  when  the  donors  had  the  option  to 
reward  self  over  the  other  monkey.  These  findings  invite  the 
hypothesis  that  OT  boosts  internal  vicarious  reinforcement  sig¬ 
nals  in  a  context-dependent  manner  in  neural  circuits  homolo- 
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gous  to  those  mediating  these  processes  in  humans.  Our  results 
demonstrate  that  OT  mediates  other-regarding  behavior  in  non¬ 
human  animals,  even  in  those  living  in  despotic  societies  with 
uniparental  care. 

Results 

Donor  monkeys  (hereafter,  “self’  or  “donor”)  performed  a  re¬ 
ward  allocation  task  with  an  unrelated  recipient  monkey 
(“other”)  (Fig.  1  A-C )  (34).  The  two  monkeys  were  seated  in 
adjacent  primate  chairs  (Crist),  100-cm  apart  and  at  45°  angles  to 
each  other.  Each  monkey  viewed  his  own  LCD  display,  and  had 
a  juice-tube  positioned  in  front  of  his  mouth  through  which  re¬ 
ward  could  be  delivered.  On  each  trial,  donors  chose  between 
two  visual  shapes,  associated  with  rewarding  self,  other,  or  nei¬ 
ther.  We  have  previously  shown  that  donors  typically  prefer  the 
shape  delivering  reward  to  other  over  neither  (34).  This  prefer¬ 
ence  is  enhanced  by  greater  familiarity  between  the  two  mon¬ 
keys,  and  is  abolished  if  the  recipient  monkey  is  replaced  with 
a  juice  collection  bottle,  thus  demonstrating  the  fundamentally 
social  nature  of  the  task  (34). 

For  each  session,  we  intranasally  (35)  delivered  25  international 
units  (IU)  of  OT  or  saline,  on  alternating  days,  to  two  males  using 
a  pediatric  nebulizer  30  min  before  performing  the  reward  alloca¬ 
tion  task.  A  session  composed  of  multiple  reward  allocation  trials 
after  either  OT  or  saline  administration  occurred  on  each  day 
(Methods).  Data  from  a  total  of  12  OT  and  10  saline  control  ses¬ 
sions  were  collected  from  two  donors  (MY  and  MO)  while  they 


engaged  in  the  reward  allocation  task  (Fig.  1  A-C)  with  an  un¬ 
related  recipient  monkey  (MD).  Five  OT  and  three  saline  sessions 
were  collected  from  MY,  and  seven  OT  and  saline  sessions  each 
were  collected  from  MO.  For  statistical  power,  we  present  data 
collapsed  across  the  two  donors,  unless  otherwise  stated. 

OT  inhalation,  compared  with  saline,  significantly  increased 
OT  concentration  in  CSF  as  measured  by  cervical  draws  (P  < 
0.05,  Welch  two-sample  t  test)  (Fig.  ID),  confirming  transnasal 
penetration  into  the  CNS.  Thirty  minutes  after  OT  administra¬ 
tion,  donors  began  the  reward  allocation  task.  For  choices  be¬ 
tween  delivering  reward  to  other  and  neither,  OT  selectively 
amplified  reward  donations  to  other  (Fig.  2).  Preference  for 
other  increased  linearly  over  time  after  OT  but  not  after  saline 
(OT:  different  from  0,  r2  =  0.26,  P  <  0.0005;  saline:  r2  =  0.01, 
P  =  0.47,  linear  regression)  (Fig.  2).  OT-induced  enhancement 
of  prosocial  choices  was  largest  in  the  later  half  of  a  given  session 
(i.e.,  — 1 10  min  after  OT  administration  and  ~80  min  after  task 
initiation;  preference  index  mean  difference  between  OT  vs. 
saline:  0.17,  P  <  0.00001,  Welch  two-sample  t  test)  (Fig.  2).  In¬ 
dividual  donors  showed  a  similar  pattern  (MY:  0.18,  P  <  0.00001; 
MO:  0.19,  P  <  0.01).  We  found  a  significant  difference  between 
the  two  treatment  conditions  even  when  we  averaged  across  the 
entire  duration  of  the  task  (mean  difference  of  0.12,  P  <  0.00001; 
MY:  0.15,  P  <  0.00001;  MO:  0.06,  P  <  0.05,  Welch  two-sample 
t  test). 

In  contrast,  in  the  early  half  of  a  given  session  (i.e.,  up  to  ~80 
min  into  the  task),  OT  slightly  but  significantly  increased  selfish 
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Fig.  1.  Reward  allocation  task.  (A)  Experimental  setup.  ( B )  Trial  sequence.  Choice  (Upper)  and  cued  (Lower)  trials  were  randomly  interleaved.  The  eye-gaze 
cartoons  specify  the  task  intervals  during  which  the  donors  could  potentially  look  at  the  recipient  monkey.  MT,  movement  time;  RT,  reaction  time.  (C)  Stimuli 
associated  with  different  reward  outcomes  to  donors  and  recipient,  shown  separately  for  the  two  donors.  (D)  OT  concentration  in  the  CSF  after  intranasal  OT 
(in  red)  or  saline  (dark  gray).  *P  <  0.05,  Welch  two-sample  t  test.  Colored  outlines  on  the  datapoints  represent  animal  identities. 
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Fig.  2.  Intranasal  OT  promotes  both  vicarious  and  self  reinforcement. 
Choice  preference  index  (moving  averages  of  200  trials  per  session,  50-trial 
step)  for  OT  (red)  and  saline  (gray)  across  all  reward  options  (other  vs.  nei¬ 
ther,  self  vs.  other,  and  self  vs.  neither).  Datapoints  from  self  vs.  other  and 
self  vs.  neither  are  jittered  along  the  ordinate  for  visibility.  (.Inset)  Unjittered 
and  magnified  data  from  self  vs.  other  trials.  Data  from  self  vs.  neither  trials 
were  effectively  overlapping  between  the  OT  and  saline  conditions,  and 
therefore  not  shown  in  an  unjittered  format.  OT,  12  sessions;  saline,  10 
sessions.  Lines  show  linear  regression  on  other  vs.  neither  trials. 


choices  on  self  vs.  other  trials  compared  with  saline  control 
(mean  difference  between  OT  and  saline  of  -0.02,  P  <  0.00001, 
Welch  two-sample  t  test;  Inset  in  Fig.  2  shows  unjittered  self  vs. 
other  trials),  but  had  no  effect  on  self  vs.  neither  trials  (mean 
difference  of  -0.002,  P  =  0.36).  Individual  donors  showed 
a  similar  selfish  bias  (MY:  -0.003,  P  <  0.06;  MO:  -0.04,  P  < 
0.00001).  The  absence  of  OT  effect  on  self  vs.  neither  trials  might 
be  due  to  the  fact  that  this  context  does  not  involve  a  potential 
reward  to  another  monkey,  although  we  cannot  rule  out  the 
possibility  that  donors  were  maximally  self-regarding  in  this 
context  in  the  absence  of  OT.  Thus,  OT  robustly  enhanced 
prosocial  choices  when  there  was  no  potential  cost  to  self,  but 
slightly  increased  selfish  choices  when  there  was  potential  for 
direct  self  reward. 

Donor  monkeys  often  shift  gaze  to  the  recipient  monkey  after 
making  a  choice,  and  this  attention  to  the  recipient  is  enhanced 
after  prosocial  choices  compared  with  selfish  choices  (34).  OT 
further  enhanced  this  overt  other-oriented  attention  to  the  re¬ 
cipient  after  donors  made  a  decision  on  other  vs.  neither  trials 
(Fig.  3A)  (OT  vs.  saline:  mean  difference  of  4.70%,  P  <  0.05, 
Welch  two-sample  t  test).  In  contrast,  we  did  not  observe  any 
effects  of  OT  on  donor’s  attention  to  the  recipient  when  direct 
self  reward  was  involved  (self  vs.  neither:  mean  difference  of 
-0.36%,  P  =  0.95;  self  vs.  other:  0.03%,  P  =  0.99)  (Fig.  3A).  We 
also  found  that  donors  looked  more  frequently  to  the  recipient 
when  rewards  were  delivered  to  him  compared  with  when 
rewards  were  delivered  to  self,  even  on  cued  trials  in  which 
rewards  were  delivered  by  computer  without  any  action  by 
donors  (gaze  frequency  on  self-cued  vs.  other-cued  trials:  OT, 
P  <  0.005;  saline:  P  =  0.05)  (Fig.  3/1).  However,  OT  did  not 
modulate  this  difference  in  social  attention  on  cued  trials  (all 
comparisons  P  >  0.23,  Welch  two-sample  t  test)  (Fig.  3/1),  sug¬ 
gesting  that  OT  enhances  other-oriented  attention  selectively 
following  prosocial  decisions  rather  than  in  response  to  anything 
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Fig.  3.  Intranasal  OT  enhances  attention  to  the  recipient  monkey  and 
increases  the  deliberation  time  for  making  donation  decisions.  (A)  Gaze  to 
the  face  of  the  other  monkey  after  reward  delivery.  (Left)  Percentages  of 
gaze  shifts  to  the  recipient  monkey  on  choice  trials  (Upper)  and  cued  trials 
(Lower).  (Right)  Number  of  gaze  shifts  over  the  course  of  each  day  session 
for  other  vs.  neither  choice  trials  (moving  averages  of  200  trials  per  session, 
50-trial  step).  Lines  through  the  datapoints  show  linear  regressions.  (6)  Re¬ 
sponse  times,  measured  as  saccade  onset  times  following  target  onset  (ms). 
(C)  OT  reduced  choice  avoidance  [i.e.,  declining  to  choose  by  breaking  fix¬ 
ation  upon  target  onset  (such  as,  reward  options),  which,  in  the  task  resulted 
in  a  time  out  for  5  sj.  *P  <  0.05,  Welch  two-sample  t  test. 


happening  to  the  other  monkey  (i.e.,  after  active  choices  on 
other  vs.  neither  trials).  As  in  the  other-oriented  choice  prefer¬ 
ence,  attention  to  the  recipient  monkey  also  increased  linearly 
over  time  after  OT  (slope  significantly  different  from  0:  r  = 
0.31,  P  <  0.00001,  linear  regression)  (Fig.  3 A,  Right).  The  fre¬ 
quency  of  looking  at  the  recipient  monkey  in  the  saline  control 
also  increased  over  the  course  of  the  session  (r  =  0.19,  P  < 
0.005),  but  with  a  significantly  lower  rate  of  rise  than  the  OT 
condition  (differences  in  OT  and  saline  slopes  greater  than  zero: 
P  <  0.005,  permutation  test)  (Fig.  3 A).  This  finding  suggests  that 
OT  enhances  the  intensity  of  vicarious  reinforcement  in  part  by 
modulating  attentional  mechanisms. 

We  also  examined  the  time  required  by  monkeys  to  render 
a  decision.  Response  times  in  the  reward  allocation  task  are 
generally  slower  when  donor  monkeys  choose  between  delivering 
reward  to  other  vs.  neither,  compared  with  when  self  reward  is 
involved  (34).  OT  selectively  prolonged  response  times  on  other 
vs.  neither  trials  (mean  difference  between  OT  and  saline  of  26.0 
ms,  P  <  0.00001,  Welch  two-sample  t  test)  (Fig.  3 B),  possibly 
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reflecting  internal  processes,  such  as  deliberation  and  control. 
On  self  vs.  neither  and  self  vs.  other  trials,  however,  OT  only 
showed  a  trend  on  response  times  (self  vs.  other:  mean  differ¬ 
ence  of  14.78  ms;  self  vs.  neither:  8.72  ms;  both  P  <  0.13)  (Fig. 
3 B).  Finally,  on  some  trials,  donors  avoided  making  a  decision, 
opting  to  wait  until  the  next  trial  (although  they  could  not  predict 
the  subsequent  reward  options).  OT  reduced  this  choice  avoid¬ 
ance  behavior  across  all  trial  types  (all  P  <  0.05,  Welch  two- 
sample  t  test)  (Fig.  3C),  perhaps  because  of  overall  enhancement 
in  subjective  reinforcement. 

Inhaled  OT  thus  influenced  reward  donation  decisions  by 
rhesus  macaques  when  there  was  an  option  to  reward  another 
monkey  (other  vs.  neither  and  self  vs.  other,  but  not  self  vs.  nei¬ 
ther).  OT  enhanced  reward  donations  on  other  vs.  neither  trials, 
but  increased  selfish  behavior  on  self  vs.  other  trials  (Fig.  2). 
OT-induced  changes  in  attention  to  the  recipient  monkey  (Fig. 
3 A)  and  decision  time  (Fig.  3 B)  were  both  specific  to  the  donation 
context  (other  vs.  neither),  whereas  OT-induced  reductions  in 
choice  avoidance  behavior  (Fig.  3C)  were  global. 

Discussion 

Compared  with  some  other  nonhuman  primates,  social  behavior 
of  rhesus  monkeys  is  primarily  characterized  by  competition  and 
aggression,  and  shows  very  weak,  if  any,  inclination  toward  co¬ 
operation  (36,  37).  In  a  prior  study,  different  levels  of  endoge¬ 
nous  OT  were  reported  in  more  socially  affiliative  mother-reared 
compared  with  more  socially  agnostic  nursery-reared  macaques 
(38).  Here  we  show  that  exogenous  OT  promotes  social  donation 
behavior  in  rhesus  macaques,  as  it  does  in  more  egalitarian  and 
monogamous  species,  like  prairie  voles  and  humans.  OT-induced 
prosocial  donations  were  accompanied  by  enhanced  other-ori¬ 
ented  attention  and  decision  times.  In  contrast,  in  a  context  in 
which  there  was  a  potential  for  rewarding  self  or  another  mon¬ 
key,  OT  slightly  increased  the  tendency  for  donors  to  choose 
selfishly  without  influencing  overt  attention  and,  at  most,  mini¬ 
mally  affecting  decision  times.  The  absence  of  OT-induced  en¬ 
hancement  of  overt  attention  on  these  trials  suggests  that  OT 
modulates  other-oriented  preferences  through  vicarious  re¬ 
inforcement  (34).  These  findings  are  consistent  with  context- 
dependent  effects  of  OT  on  human  social  behavior  (16,  17,  39) 
(for  a  review  of  human  social  processing,  see  ref.  40),  implying 
similar  neural  mechanisms. 

Given  the  context-specific  increase  in  attention  to  the  other 
monkey  and  more  deliberative  decision  latency,  it  is  conceivable 
that  these  behaviors  are  related.  Several  hypotheses  are  plausible. 
On  the  one  hand,  OT  may  increase  attention  to  the  other  monkey 
via  neural  circuits  mediating  orienting  behavior,  including  amyg¬ 
dala,  parietal  cortex,  and  superior  colliculus.  Increased  attention  to 
the  recipient  may  enhance  vicarious  reinforcement  experienced 
from  delivering  juice  to  him.  Alternatively,  OT  may  influence 
neural  circuits  involved  in  decision-making,  including  the  striatum 
and  anterior  cingulate  cortex  (see  introductory  paragraphs). 
Slowed  response  times  may  reflect  more  deliberate  processing  of 
the  potential  outcomes  available  (41).  A  future  study  designed  to 
probe  the  temporal  evolution  of  OT-induced  effects  on  attention 
and  decision-making  will  be  needed  to  resolve  these  hypotheses. 

The  direction  of  OT-induced  social  enhancement  also  appears  to 
vary  as  a  function  of  time.  OT  initially  enhanced  self  reinforcement 
but  later  amplified  vicarious  reinforcement,  although  the  largest 
OT-induced  effects  were  prosocial.  Although  this  interaction  be¬ 
tween  time-dependent  and  context-dependent  effects  of  OT  may 
be  specific  to  our  reward  allocation  task  and  thus  can  only  be  ex¬ 
trapolated  with  caution,  these  results  suggest  that  OT  may  in¬ 
fluence  self-  and  other-regarding  behaviors  via  distinct  underlying 
neural  mechanisms. 

Why  might  OT  promote  self  reinforcement  bias  on  self  vs.  other 
but  not  on  self  vs.  neither  trials?  The  key  difference  between  the 
two  contexts  is  the  alternative  option.  In  one  context,  the  alter¬ 


native  option  has  a  social  consequence  (i.e.,  rewarding  the  re¬ 
cipient),  whereas  in  the  other  context,  the  alternative  option  does 
not  (i.e.,  nothing  happens  to  either  donor  or  recipient).  OT-in- 
duced  self  reinforcement  may  depend  on  the  contrast  between 
rewarding  self  and  another  individual.  We  hypothesize  that  when 
a  decision  context  presents  this  contrast,  OT  can  promote  selfish 
behavior.  OT  influences  on  self  and  vicarious  reinforcement  (16, 
17,  39)  thus  appear  to  depend  on  the  social  state  of  the  underlying 
neural  circuits. 

Previous  studies  in  monogamous  prairie  voles  and  promiscuous 
montane  voles  ( Microtus  montanus)  have  suggested  that  mating 
system  may  be  a  key  predictor  of  OT  influences  on  social  behavior 
through  the  topology  of  OT  receptor  localization  in  neural  cir¬ 
cuits,  mediating  reinforcement  and  motivation  (33).  A  more 
general  difference  between  prairie  voles  and  montane  voles  is  the 
frequency  and  intensity  of  social  interaction  (33).  Compared  with 
montane  voles,  prairie  voles  are  biparental,  show  more  selective 
aggression,  and  spend  more  time  in  close  physical  proximity  (33). 
Humans  and  rhesus  macaques,  too,  are  highly  social  mammals; 
intranasal  OT  induces  prosocial  tendencies  in  humans  (15,  16) 
and,  as  we  now  report,  in  rhesus  macaques.  These  findings  suggest 
that  OT  may  play  a  critical  role  in  modulating  social  behavior  in 
highly  gregarious  mammals,  regardless  of  mating  system  or  pa¬ 
rental  care  strategy. 

Intranasal  administration  of  OT  in  humans  has  also  been 
shown  to  increase  gaze  to  the  eyes  of  others  (19).  We  found  that 
OT  enhanced  gaze  directed  at  the  face  of  the  other  monkey 
following  active  social  decision-making  but  not  following  passive 
reward  delivery.  This  finding  invites  the  possibility  that  OT  gates 
the  activity  of  attention  circuits  in  the  brain  specifically  during 
active  interaction  with  others.  Evidence  from  human  functional 
neuroimaging  studies  is  consistent  with  this  idea.  For  example, 
OT  selectively  modulates  BOLD  signal  in  the  anterior  cingulate 
cortex,  amygdala,  midbrain,  and  dorsal  striatum  during  a  trust 
game  involving  other  human  players,  but  not  during  a  nonsocial 
decision-making  task  (29).  Functional  connectivity  between  the 
amygdala  and  midbrain  structures  is  also  reduced  by  OT  when 
human  participants  view  emotional  faces  (28).  Finally,  OT 
reduces  the  subjective  evaluation  of  aversively  conditioned  faces, 
and  this  reduction  is  accompanied  by  suppressed  BOLD 
responses  in  the  amygdala  and  the  fusiform  gyrus  (42). 

Consistent  with  our  results,  OT  modulates  deliberation  times 
during  social  decision-making  in  humans.  For  example,  OT  slows 
overall  evaluation  time  for  rating  faces  in  a  nonspecific  manner, 
regardless  of  whether  the  images  were  aversively  conditioned  or 
not  (42).  OT  can  also  speed  up  decision  times;  for  example,  OT 
decreased  overall  key  press  reaction  times  for  evaluating  in¬ 
group  favoritism  and  out-group  derogation  in  an  implicit  asso¬ 
ciation  test  (17). 

OT  enhanced  the  frequency  of  prosocial  decisions  in  the  absence 
of  opportunity  for  direct  self  reward,  but  provoked  an  increase  in 
selfish  decisions  when  choosing  between  self  and  other.  Such  a  dual 
function  has  also  been  reported  in  humans.  OT  can  both  promote 
cooperation  and  increase  out-group  bias  depending  on  behavioral 
context  (16,  17,  39).  Thus,  OT  does  not  appear  to  have  a  universal 
prosocial  influence  on  behavior,  but  rather  amplifies  ongoing  social 
information  processing  (21),  perhaps  by  influencing  already  existing 
preferences.  It  is  plausible  that  OT  mediates  prosociality  and 
generosity  only  in  an  indirect  manner.  Alternatively,  OT  may  play 
a  more  direct  and  causal  role  in  modulating  context-dependent 
social  information  processing  (e.g.,  refs.  27-29  for  neural  evidence), 
specifically  by  enhancing  the  gain  of  neural  circuits  mediating 
vicarious  reinforcement  and  attention. 

Recently,  OT  has  been  evaluated  for  potential  therapeutic  use  in 
clinical  conditions  attended  by  dysfunctional  social  behavior,  such 
as  autism  spectrum  disorders,  antisocial  personality  disorder,  and 
schizophrenia  (20-24,  43,  44).  Notably,  the  intranasal  nebulization 
method  (35)  we  developed  here  is  well-tolerated  by  children  for 
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delivery  of  other  therapeutics  (i.e.,  albuterol),  thus  opening  up 
avenues  for  early  OT  intervention  in  neuropsychiatric  conditions 
with  social  deficits.  Furthermore,  choice-specific  effect  of  OT  on 
increasing  other-oriented  attention  suggests  a  potential  need  for 
active  decision-making  during  OT  interventions. 

The  current  finding  opens  up  new  opportunities  for  uncover¬ 
ing  the  mechanisms  underlying  the  influences  of  OT  on  social 
behavior  in  a  species  much  more  closely  related  to  humans  than 
rodents.  Rhesus  monkeys  have  long  served  as  the  primary  model 
species  for  probing  the  neural  mechanisms  mediating  high-level 
cognition.  Given  the  strong  similarities  in  social  behavior  and 
cognition,  and  the  apparent  homologies  in  underlying  neural 
circuitry,  the  rhesus  macaque  provides  a  powerful  model  for 
probing  the  mechanisms  mediating  some  of  the  basic  behaviors 
that  make  complex  human  social  interactions  possible. 

Methods 

General  Procedures  and  Behavioral  Task.  All  procedures  were  approved  by  the 
Duke  University  Institutional  Animal  Care  and  Use  Committee.  Two  donor 
monkeys  (MY  and  MO)  and  a  recipient  monkey  (MD)  participated  in  the 
study.  All  animals  underwent  standard  surgical  procedures  for  implanting 
a  head-restraint  prosthesis  at  least  6  mo  before  the  present  study.  The  head- 
restraint  prosthesis  allowed  us  to  monitor  eye  position,  sampled  at  1,000  Hz 
(SR  Research;  Eyelink),  as  well  as  conduct  single-unit  recordings  in  other 
experiments,  not  reported  here.  Both  the  donor  and  recipient  were  head- 
restrained  throughout  the  experiment.  Donors  and  recipient  were  unrelated, 
middle-ranked,  and  not  cage  mates.  Face  of  recipient  (other;  corresponding 
horizontal  and  vertical  eye  positions)  was  empirically  mapped.  Rewards  were 
0.5-1 .0  mL  of  cherry-flavored  juice.  Within  each  block,  reward  size  was 
constant  for  all  three  outcomes.  A  separate  solenoid  was  designated  for 
rewarding  neither  that  only  produced  clicks  but  delivered  no  fluid.  To  pre¬ 
vent  monkeys  from  forming  secondary  associations  between  solenoid  clicks 
and  different  reward  types,  all  solenoid  valves  (including  the  one  used  to 
deliver  "neither"  reward)  used  to  deliver  juice  rewards  were  placed  in  an¬ 
other  room.  Masking  white  noise  was  also  played  in  the  experimental  room. 

Donors  began  the  trial  by  shifting  gaze  (±  2.5°)  to  a  central  stimulus  (0.5°  x 
0.5°),  and  maintained  fixation  (for  200  ms).  Choice  and  cue  trials  were  pre¬ 
sented  at  equal  frequencies  and  randomly  interleaved.  On  choice  trials  (Fig. 
IB),  two  visual  targets  (4°  x  4°)  appeared  at  two  random  locations  of  7° 
eccentricity  and  reflected  about  the  vertical  meridian.  Donors  shifted  their 
gaze  to  one  target  (±  2.5°)  to  indicate  their  choice.  On  cued  trials  (Fig.  IB), 
donors  maintained  fixation  while  a  cue  appeared  centrally  (for  500  ms).  On 
both  trial  types,  the  reward  onset  was  followed  by  a  0  to  0.9  s  delay.  Donors 
could  freely  look  around  for  0-0.9  s  following  making  a  choice  and  for  an¬ 
other  1  s  after  the  reward  onset.  Data  from  error  trials  are  not  included 
in  analyses. 

Data  from  12  OT  (MY:  5,  MO:  7)  and  10  saline  (MY:  3,  MO:  7)  sessions  were 
collected  on  strictly  alternating  days.  Each  day  session  was,  on  average,  1,274  ± 
141  (mean  ±  SEM)  trials.  Within  each  day  session,  several  blocks  of  the  task  (a 
median  of  6  and  6.5  blocks  for  OT  and  saline,  respectively)  were  completed  by 
the  donors.  Each  of  these  blocks  typically  consisted  of  192  ±  10  (mean  ±  SEM) 
and  205  ±  15  trials  for  OT  and  saline,  respectively. 

Intranasal  OT  Protocol.  Donor  monkeys  were  transported  in  the  primate  chair 
from  the  colony  room  to  the  experimental  room.  After  stabilizing  their  heads, 
OT  (25  lU/mL;  Agrilabs)  was  delivered  via  nebulization  (Pari  Baby  Nebulizer) 
into  the  nose  and  mouth  continuously  for  5  min  (5  lU/min)  when  the  donor 
monkeys  were  fully  awake.  On  alternating  days,  nebulized  saline  served  as 
a  control.  Before  experimental  sessions,  donor  monkeys  were  first  habituated 
to  the  nebulizer  and  then  accustomed  to  saline  delivery  using  the  nebulizer  in 
an  incremental  fashion  until  they  were  completely  relaxed  during  the  pro¬ 
cedure,  which  typically  took  about  a  week.  In  fact,  donor  monkeys  showed  no 
distress  during  this  procedure.  Testing  began  exactly  30  min  after  each  treat- 
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ment,  at  which  time  a  recipient  monkey  was  brought  to  the  experimental 
setup.  In  the  guinea  pig  CNS,  radioactively  labeled  OT  lasts  up  to  4  h  (45).  In 
humans,  intranasal  delivery  of  a  similar  peptide,  vasopressin  (differing  by  only 
two  amino  acids),  increases  its  concentration  in  the  CSF  after  10  min,  and  el¬ 
evated  vasopressin  levels  are  maintained  for  more  than  80  min  after  admin¬ 
istration  (35).  In  that  study  (35),  vasopressin  levels  increased  significantly  after 
30  min.  Previous  studies  in  humans  have  not  measured  inhaled  OT  uptake  into 
the  CNS.  Fig.  ID  plots  CSF  OT  levels  in  monkeys  35  min  after  inhalation, 
demonstrating  efficacy  of  the  intranasal  nebulization  method  (see  below). 
Note  that  the  mask  was  always  pressed  very  tightly  to  minimize  potential 
leakage,  but  nonetheless  leakage  could  have  occurred.  It  is  worth  noting  that 
CSF  OT  levels  may  have  continued  to  increase  after  the  time  of  CSF  mea¬ 
surement,  warranting  caution  in  linking  absolute  CSF  OT  levels  with  changes 
in  behavior.  Despite  these  uncertainties,  our  nebulization  technique  resulted 
in  a  ~2. 5-fold  increase  in  CSF  OT  levels  roughly  0.5  h  after  inhalation. 

CSF  OT  Protocol.  To  determine  whether  inhaled  OT  penetrates  the  CNS  after 
nebulization,  OT  concentration  in  CSF  was  measured  via  cervical  punctures 
(on  average  35  min  after  the  beginning  of  inhalation).  Cervical  punctures 
were  performed  by  a  licensed  veterinarian,  and  targeted  the  cisterna  magna 
through  the  juncture  between  the  occipital  base  and  atlas  (Cl)  through  the 
atlanto-occipital  membrane.  Monkeys  were  first  anesthetized  with  ketamine 
(3  mg/kg,  i.m.)  and  dexdomitor  (0.075  mg/kg,  i.m.).  To  reverse  anesthesia,  we 
administered  antisedan  (0.075  mg/kg,  i.m.)  once  the  animal  was  returned  to 
its  cage  after  the  draw.  Approximately  0.5  mL  of  CSF  was  drawn  using  a  24  to 
27  gauge  needle.  At  the  performing  veterinarian's  discretion,  bupivacaine 
was  administered  subcutaneously  at  the  insertion  site  following  needle  re¬ 
moval.  CSF  was  immediately  frozen  on  dry  ice  and  sent  off-site  to  be  assayed 
for  OT  (Biomarkers  Core  Labs,  Yerkes  National  Primate  Research  Center, 
Atlanta,  GA)  using  a  commercially  prepared  kit  [Assay  Designs  (now  Enzo 
Life  Sciences);  cat.  #  900-153:  Oxytocin  ELISA  kit,  with  very  low  reactivity 
with  vasopressin].  Samples  were  assayed  "neat"  with  a  range  of  15.6-1,000 
|iL  assay  volume.  This  assay  has  near-zero  reactivity  with  vasopressin,  which  is 
chemically  similar  to  OT,  thus  providing  specific  quantitation  of  OT. 

Data  Analysis.  Preference  index  was  a  contrast  ratio  of  frequency  of  choosing 
an  option,  nA  or  nB: 

Preference  Index  =  — — — . 

nA+nB 

For  choices  between  self  vs.  other,  nA  and  nB  were  number  of  choices  to 
reward  other  and  self,  respectively.  For  choices  between  other  vs.  neither,  nA 
and  nB  were  number  of  choices  to  reward  other  and  neither,  respectively. 
Finally,  for  choices  between  self  vs.  neither,  nA  and  nB  were  number  of 
choices  to  reward  neither  and  self,  respectively.  Indices  ranged  from  -1  to  1, 
with  1  corresponding  to  always  choosing  the  "prosocial"  option  to  reward 
the  recipient  monkey  (when  that  was  an  option)  or  to  withhold  reward  from 
self  (self  vs.  neither).  An  index  of  -1  indicated  that  donors  always  chose  an 
"antisocial"  option  to  reward  self  (when  that  was  an  option)  or  to  withhold 
reward  from  the  other  monkey  (other  vs.  neither).  Preference  index  of  0  in¬ 
dicated  indifference.  Frequency  of  donors  looking  at  recipients  was  com¬ 
puted  from  number  of  gaze  shifts  to  the  recipient's  facial  region  (within  ± 
8.5°  spanning  from  the  center  of  the  recipient's  face).  Reaction  times  (time 
from  target  onset  to  movement  onset)  were  computed  using  a  20°/s  velocity 
threshold  (46). 
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Mechanistic  Classification  of  Neural  Circuit 
Dysfunctions:  Insights  from  Neuroeconomics  Research 
in  Animals 

Steve  W.C.  Chang,  David  L.  Barack,  and  Michael  L.  Platt 

Many  psychiatric  conditions  present  complex  behavioral  symptoms,  and  the  type  and  magnitude  of  underlying  neural  dysfunction  may  vary 
drastically.  This  review  introduces  a  classification  scheme  for  psychiatric  symptoms,  describing  them  in  terms  of  the  state  of  a  dysfunctional 
neural  circuit.  We  provide  examples  of  two  kinds  of  functional  deficits:  variance-shifted  functionality,  in  which  a  damaged  circuit  continues 
to  function  albeit  suboptimally,  and  state-shifted  functionality,  resulting  in  an  absent  or  qualitatively  different  functional  state.  We  discuss, 
from  the  perspective  of  neuroeconomics  and  related  areas  of  behavioral  investigation,  three  broad  classes  of  commonly  occurring 
symptoms  in  psychopathology  based  on  selected  studies  of  decision  making  in  animals:  temporal  discounting,  social  preferences,  and 
decision  making  under  environmental  volatility.  We  conclude  that  the  proposed  mechanistic  categorization  scheme  offers  promise  for 
understanding  neural  circuit  dysfunctions  underlying  psychopathology. 


Key  Words:  Animals,  decision,  electronic  circuit,  neuroeconomics, 
psychopathology,  reward,  state-shifted,  suboptimal,  variance- 
shifted 


Comprised  of  constellations  of  behavioral  symptoms,  psychi¬ 
atric  disorders  frequently  frustrate  any  simple  attempt  to 
translate  observed  phenotype  into  neurobiological  mecha¬ 
nism.  Even  at  the  individual  symptom  level,  such  translation  is 
challenging  and  not  easily  quantifiable.  Behavioral  symptoms  are 
often  compound  and  thus  difficult  to  interpret.  This  presents  a 
challenge  for  understanding  their  core  neurobiological  features, 
creating  practical  barriers  to  designing  behavioral  or  diagnostic 
tests.  This  difficulty  may  be  amplified  when  studying  the  illnesses 
manifested  as  a  result  of  dysfunctions  in  the  prefrontal,  limbic,  and 
para  limbic  regions,  which  are  less  well  understood,  compared  with, 
for  example,  the  occipital  cortex.  A  promising  alternative  to  under¬ 
standing  the  neurobiology  of  psychiatric  disorders  begins  by  clas¬ 
sifying  them  according  to  the  ways  the  underlying  mechanisms 
may  fail.  In  this  issue  exploring  the  benefits  of  a  neuroeconomics 
approach  for  understanding  psychopathology,  we  outline  a  mech¬ 
anistic  classification  scheme  grounded  in  the  principles  of  neuro- 
economic  studies  of  cognition  and  behavior  in  animals. 

Variance-Shifted  Versus  State-Shifted  Functionality: 
Insights  from  Electronics 

Dysfunctional  neural  circuitry  can  be  functionally  classified  into 
two  different  states  based  on  the  outputs  of  disrupted  circuits.  As 
an  illustration,  consider  an  electronic  circuit  designed  to  produce  a 
specific  output.  A  variance-shifted  circuit  operates  with  added 
noise  and,  therefore,  generates  a  broadened  output  distribution, 
resulting  in  suboptimal  performance.  However,  a  suboptimal  cir¬ 
cuit  may  continue  to  process  information  (1).  By  contrast,  a  state- 
shifted  circuit  may  generate  a  completely  different  functional  out- 
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put,  either  beyond  the  expectation  of  a  downstream  circuit  or 
failing  to  generate  any  output  at  all,  producing  a  qualitatively  dif¬ 
ferent  or  absent  output  and  resulting  in  behavior  drawn  from  a 
different  distribution  altogether  (1 ). 

As  a  simplified  analogy,  a  simple  band-pass  filter  illustrates  the 
different  classes  of  damage-induced  functional  states.  A  change  in 
circuit  resistance  or  capacitance  will  change  the  effective  cutoff 
frequency,  while  a  short  in  the  system  effectively  halts  filtering  (1). 
Changes  in  a  circuit's  resistance  will  result  in  a  noisier  output,  anal¬ 
ogous  to  psychiatric  conditions  in  which  afflicted  individuals  show 
difficulty  in  evaluating  changes  in  the  environment.  Such  damage 
to  the  circuit  reveals  its  critical  role  for  producing  adaptive,  normal 
behavior.  In  contrast,  the  presence  of  a  short  in  the  system  will 
prevent  filtering  of  relevant  information,  analogous  to  situations 
where  afflicted  individuals  completely  lose  sensitivity  to  changes  in 
the  environment.  In  this  case,  the  state-shifted  circuit  reveals  its 
necessary  role  in  the  production  of  a  particular  behavior. 

The  intricate  balance  between  circuit  components  can  result  in 
functional  changes  that  are  either  large  and  noticeable  or  small  and 
subtle.  Some  neuropsychiatric  symptoms  only  differ  from  others 
slightly,  whereas  others  are  so  specific  to  a  condition  that  they  serve 
as  a  diagnostic  hallmark.  Furthermore,  because  of  the  complex  and 
multilayered  nature  of  neural  circuits,  initial  perturbations  may  re¬ 
sult  at  first  in  a  state-shifted  circuit  that,  due  to  neural  plasticity, 
resolves  back  to  a  variance-shifted,  or  even  fully  restored,  state.  In 
summary,  psychiatric  symptoms  may  result  from  a  relatively  pre¬ 
served  neural  circuit  operating  with  added  noise,  producing  devi¬ 
ant  and  suboptimal  behavior  (variance-shifted  functionality).  Alter¬ 
natively,  it  may  arise  from  a  shorted  circuit  producing  completely 
different  or  absent  behaviors  (state-shifted  functionality). 

The  two  damaged  states  can  be  described  in  terms  of  neural 
network  models  as  well.  In  a  trained  neural  network,  the  organiza¬ 
tional  principles  involve  individual  computational  units,  or  nodes, 
whose  functionalities  may  be  obscure  and  may  encode  information 
idiosyncratically  (2,3).  A  variance-shifted  functional  state  may  result 
from  damage  to  peripheral  nodes,  whereas  a  state-shifted  state 
may  be  induced  by  damage  to  a  central  node  in  the  network.  The 
two  functionalities  can  also  be  described  based  on  the  output  sta¬ 
tistics  of  an  implicated  circuit.  A  variance-shifted  dysfunction  in  a 
neural  circuit  may  produce  circuit  (or  behavioral)  outputs  charac¬ 
terized  by  a  broadened  and/or  attenuated  distribution  compared 
with  optimal  functionality  (thus  less  specific  or  more  noisy).  In  con¬ 
trast,  a  state-shifted  dysfunction  in  a  circuit  may  produce  an  output 
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drawn  from  a  completely  different  distribution  (thus  qualitatively 
different)  or  may  result  in  a  complete  failure  to  produce  any  output. 
It  is  worthwhile  to  note  that  a  state  shift  could  occur  in  the  direction 
of  extreme  enhancement,  resulting  in  exaggerated  behavior  such 
as  positive  symptoms  in  schizophrenia. 

Our  classification  scheme,  though  neither  exceptionless  nor  ex¬ 
haustive,  provides  insight  into  the  possible  mechanisms  underlying 
psychiatric  symptoms.  The  two  deficit  types  may  occur  simultane¬ 
ously  or  sequentially  (and  the  distinction  sometimes  can  be  ambig¬ 
uous  until  a  given  circuit  is  fully  understood)  but  may  provide  novel 
mechanistic  insights  into  psychopathology  and  inform  the  relation¬ 
ship  of  pathology  to  health.  This  approach  differs  fundamentally 
from  the  Diagnostic  and  Statistical  Manual  of  Mental  Disorders,  the 
International  Classification  of  Diseases,  and  the  like,  which  are  de¬ 
signed  to  describe  a  disorder  using  a  list  of  behavioral  symptoms  for 
diagnostic  purposes.  The  present  scheme  is  useful  for  directly  com¬ 
paring  the  functionality  of  neural  mechanisms  and  their  corre¬ 
sponding  behaviors  across  normal  and  dysfunctional  states  of  the 
brain.  A  successful  distinction  between  variance-  and  state-shifted 
dysfunction  is  constrained  by  our  understanding  of  a  given  circuit. 
For  example,  a  variance-shifted  dysfunction  under  one  functional 
criterion  could  be  seen  as  a  state-shifted  condition  under  a  different 
framework.  Such  ambiguity,  which  is  present  in  any  classification 
scheme,  can  only  be  resolved  through  more  comprehensive  under¬ 
standing  of  a  circuit. 

Examples  from  Oculomotor  System 

Examples  from  oculomotor  system  help  illustrate  the  two  dis¬ 
tinct  dysfunctional  states  described  above.  The  superior  colliculus 
and  frontal  eye  fields  belong  to  a  distributed  oculomotor  circuit 
spanning  cortical  and  subcortical  structures  (4,5).  Frontal  eye  field 
lesions  increase  variability  in  saccade  trajectories  and  severely  dis¬ 
rupt  selection  of  targets  in  the  contralesional  hemifield  (6).  Frontal 
eye  field  lesioned  animals,  however,  can  still  saccade  (6).  By  con¬ 
trast,  superior  colliculus  lesions  temporarily  abolish  contralesional 
saccades  altogether  (7).  They  also  permanently  increase  saccade 
latencies  and  eliminate  the  animal's  ability  to  make  express  sac- 
cades  (saccades  with  reaction  times  less  than  1 00  msec  in  monkeys) 
in  a  gap  task  (7),  designed  to  bypass  the  time  required  to  disengage 
from  visual  fixation  by  inserting  a  gap  between  the  offset  of  a 
fixation  stimulus  and  target  onset  (8).  Therefore,  for  saccades,  fron¬ 
tal  eye  field  disruption  results  in  noisy  (i.e.,  variable)  performance 
but  preserves  overall  functionality,  a  variance-shifted  dysfunction. 
Superior  colliculus  damage  alone,  by  contrast,  is  sufficient  to  tem¬ 
porarily  abolish  saccades,  which  is  consistent  with  a  state-shifted 
dysfunction.  These  examples  demonstrate  that  distinct  mechanis¬ 
tic  deficits  can  impair  or  abolish  normal  function. 

Neuroeconomics  of  Decision  Making  in  Animals 

Neuroeconomics,  a  discipline  that  marries  the  mathematical 
formalisms  of  classical  economics,  the  psychophysical  methods  of 
behavioral  economics,  and  contemporary  neurosciences  (9-11), 
provides  an  illuminating  test  of  the  functionality-based  classifica¬ 
tion  scheme  for  defining  mechanistic  pathologies  in  decision  mak¬ 
ing  (fora  review  regarding  the  benefits  of  animal  models  in  neuro¬ 
economics,  see  [12]).  The  approach  applies  mathematically 
tractable  economic  formalizations  to  the  nervous  system  and  fo¬ 
cuses  on  basic  economic  concepts  such  as  utility  (9,13-15),  risk 
(16,17),  and  temporal  discounting  (18,19),  providing  quantitative 
frameworks  for  examining  the  neural  mechanisms  underlying  cog¬ 
nitive  processes  (12). 


The  neuroeconomic  framework  in  animal  models  is  advanta¬ 
geous  for  studying  complex  forms  of  decision  making  by  tapping 
into  their  innate  reward-seeking  behaviors  while  maintaining  etho- 
logical  validity.  Unlike  in  humans,  animal  models  offer  access  to 
studying  complex  behaviors  at  the  resolution  of  single  neurons. 
Further,  insights  into  different  types  of  mechanistic  deficits  in  neu¬ 
ropsychiatric  symptoms  can  be  obtained  by  studying  decisions 
animals  make  following  perturbation  of  neural  circuits.  Thus,  ani¬ 
mal  models  of  decision  making  provide  valuable  insights  into  char¬ 
acterizing  the  biological  mechanisms  of  behavior,  detailing  the 
formal  operations  the  brain  performs  in  realizing  different  cognitive 
capacities. 

We  discuss  a  selection  of  experiments,  categorizing  the  ob¬ 
served  deficits  as  the  variance-shifted  and  state-shifted  model  of 
neural  circuit  dysfunctions.  We  organize  this  discussion  around 
three  examples  of  circuit  dysfunction  in  light  of  neuroeconomics 
and  other  related  disciplines:  disorders  of  temporal  discounting  in 
addiction,  social  and  other-regarding  preferences  (ORP),  and  deci¬ 
sion  making  under  environmental  volatility.  Our  intention  is  not  to 
establish  necessary  and  sufficient  conditions  for  connecting  a  spe¬ 
cific  dysfunction  and  a  specific  neural  circuit.  Doing  so  would  not  be 
practically  possible.  Instead,  in  this  exercise,  we  attempt  to  label 
experimentally  induced  behavioral  deficits  observed  in  animals  as 
dysfunctions  arising  from  either  a  variance-  or  state-shifted  func¬ 
tional  state  in  the  implicated  circuit.  Although  this  classification 
scheme  can  be  just  as  easily  applied  to  any  perturbation  results 
(e.g.,  microstimulation  or  drug  infusion),  we  focus  on  lesion  studies 
for  their  blunt  effectiveness  in  perturbing  circuit  function. 

Addiction  as  a  Disorder  of  Temporal  Discounting 

Single-unit  recordings  in  animals,  as  well  as  neuroimaging  in 
humans,  have  found  that  striatal  dopaminergic  signaling  is  critical 
for  reward-related  processing,  including  motivation  and  learning 
(20-22),  and  that  dysfunctional  dopaminergic  signaling  disrupts 
reward  anticipation  in  drug  addiction  (for  a  review,  see  [23-25]). 
Firing  rates  of  midbrain  dopamine  neurons  compute  economic 
decision  parameters,  such  as  reward  probability,  reward  delay,  and 
reward  uncertainty  (26-28).  Dopaminergic  signaling  is  also  in¬ 
volved  in  evaluating  the  economic  costs  and  benefits  of  upcoming 
rewards.  For  example,  neurons  in  rodent  nucleus  accumbens  (NAc) 
encode  anticipated  reward  benefits,  without  encoding  response 
costs  to  achieve  the  reward  (28).  Such  economic  computations  by 
the  mesolimbic  dopamine  system  may  contribute  to  addiction  and 
other  motivation-related  disorders. 

Temporal  discounting  describes  a  time-dependent  devaluation 
of  economic  value  (18).  It  is  a  phenomenon  observed  across  multi¬ 
ple  species  including  rodents,  monkeys,  and  humans  (18,29,30). 
When  provided  an  option  to  choose  an  immediate  but  smaller 
reward  over  a  larger  reward  with  a  longer  delay,  animals  reliably 
prefer  the  immediate  option  (31).  Addicted  individuals  discount 
more  than  nonaddicted  individuals  (24,32),  as  evidenced  by  behav¬ 
iors  manifested  in  addiction  to  cocaine,  alcohol,  opioid,  nicotine, 
and  gambling  (for  a  review,  see  [32]).  Therefore,  a  disruption  in 
temporal  discounting  may  be  a  common  mechanistic  deficit  shared 
by  many  classes  of  addiction. 

Single-unit  recordings  in  monkeys  demonstrate  that  neurons  in 
the  striatum  mediate  computations  underlying  temporal  discount¬ 
ing  (33).  Rats  with  NAc  lesions  display  severe  difficulty  in  choosing  a 
delayed  reward  option  in  an  intertemporal  choice  task,  suggesting 
a  critical  role  of  NAc  in  computing  economic  values  of  rewards  in 
time  (34).  Further,  NAc  lesions  do  not  abolish  reward  sensitivity 
altogether  but  impair  the  implementation  of  an  optimal  (reward- 
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maximizing)  strategy  (35),  as  if  these  animals  cannot  accurately 
compute  temporally  discounted  utility  to  guide  decisions.  Similarly, 
addicted  individuals  rarely  lose  the  ability  to  seek  addicted  sub¬ 
stances.  Rather,  they  display  impaired  impulsive  control  in  pursuing 
immediate  rewards,  consistent  with  atypical  temporal  discounting. 
Thus,  addiction-related  deficits  resemble  a  variance-shifted  func¬ 
tionality,  resulting  in  disrupted  decisions  in  time,  though  retaining 
some  sensitivity  to  reward  (i.e.,  performance  does  not  become  ran¬ 
dom  and  the  discounting  function  does  not  become  flat).  Deficits 
resulting  from  perturbations  to  dopamine  circuits  performing  eco¬ 
nomic  calculations  seem  to  cause  noisy  mappings,  or  variance  shifts 
in  the  representations,  among  reward,  action,  and  time. 

Neural  correlates  of  temporal  discounting  are  also  found  in  the 
prefrontal  cortex  (for  a  review,  see  [36]).  Neurons  in  dorsolateral 
prefrontal  cortex  (dIPFC)  encode  the  temporally  discounted  value 
of  upcoming  rewards  (19).  A  cocaine  self-administration  study  in 
monkeys  found  that  activity  in  the  anterior  cingulate  cortex  (ACC)  is 
enhanced  upon  cocaine  intake  (37),  consistent  with  human  neuro¬ 
imaging  studies  showing  that  drug  seeking  in  addiction  is  linked  to 
the  prefrontal  cortex  (38,39).  ACC  involvement  in  reward-guided 
decision  making  is  not  limited  to  processing  directly  experienced 
outcomes  but  also  includes  Active  outcomes  (40),  similar  to  the 
human  ventral  striatum  (41).  Correctly  utilizing  such  Active  signals 
may  be  critical  in  addiction.  Individuals  with  chronic  nicotine  addic¬ 
tion  fail  to  utilize  these  signals  to  adjust  their  choices  in  an  invest¬ 
ment  task  (42).  Furthermore,  gambling  addiction  seems  to  require 
rewards  that  are  delivered  according  to  a  partial  or  a  variable  sched¬ 
ule  (43),  coupled  with  near-miss  Active  reward  signals. 

Disorders  of  Social  and  Other-Regarding  Preferences 

Precisely  how  social  information  is  integrated  into  economic 
decisions  in  neural  circuits  remains  obscure.  Understanding 
whether  social  disorders  are  manifested  by  a  deAcit  in  a  decision 
circuit  or  a  circuit  purely  involved  in  evaluating  social  information 
from  the  environment  remains  a  challenge.  ORPs  describe  a  consid¬ 
eration  for  the  economic  well-being  of  others.  ORP  computations 
may  reAect  a  stage  where  decision  making  and  social  information 
processing  are  partially  integrated.  Consider  autism  spectrum  dis¬ 
order  (ASD),  which  handicaps  social  and  communicative  abilities  of 
~1  per  110  children  in  the  United  States  (44).  Autism  spectrum 
disorder  individuals  show  little  interest  in  others  (45).  This  lack  of 
interest  is  associated  with  other  complex  social  deAcits,  including 
reduced  empathy  and  joint  attention,  further  disrupting  the  capac¬ 
ity  for  normal  social  interactions  (46,47).  Differences  between  ASD 
and  typically  developing  individuals  are  illustrated  by  performance 
in  economic  bargaining  games  designed  to  elicit  ORP.  While 
healthy  individuals  readily  engage  in  reciprocal  cooperation  in 
these  games,  ASD  individuals  adopt  simple  rules  that  are  both  less 
Aexible  and  more  laboriously  employed  (48).  It  remains  unclear 
whether  circuit  dysfunctions  in  ASD  more  closely  resemble  vari¬ 
ance-shifted  or  state-shifted  states.  Comparison  with  other  disor¬ 
ders  marked  by  social  deAcits,  such  as  schizophrenia,  psychopathy, 
and  eating  disorders,  may  help  to  illuminate  the  underlying  pathol¬ 
ogy  in  ASD. 

ACC  is  critical  for  social  processing.  ACC  gyrus  lesions  in  mon¬ 
keys  abolish  the  animal's  ability  to  evaluate  social  information,  as 
measured  by  response  latencies  to  retrieve  food  in  the  presence  of 
socially  arousing  images,  such  as  staring  monkeys  (49).  Although 
the  changes  in  response  latencies  in  ACC-lesioned  animals  can 
differ  substantially  depending  on  the  types  of  social  stimuli  and 
often  on  the  individuals,  sensitivity  to  social  stimuli  can  be  elimi¬ 
nated  by  the  lesion  (49).  This  social  evaluation  deAcit  therefore 


resembles  a  state-shifted  functionality,  in  which  social  evaluation 
processing  is  no  longer  intact.  In  contrast,  ACC  sulcus  and  orbito- 
frontal  cortex  (OFC)  lesions  produce  deviant  behaviors  but  fail  to 
abolish  the  sensitivity  to  social  stimuli  (49),  resembling  a  noisy 
suboptimal  state  and  a  variance-shifted  functionality. 

Closely  related  to  ORP,  empathy-related  processing  by  ACC  has 
been  investigated  in  the  context  of  perceiving  painful  events  of 
others.  The  brain  areas  involved  in  pain  perception  in  humans, 
namely  ACC  and  frontal  insula,  are  more  metabolically  active  when 
perceiving  a  painful  stimulus  delivered  to  fair  compared  with  unfair 
players  in  an  economic  game  (50).  In  rodents,  ACC,  along  with  other 
medial  pain  systems,  mediates  observational  fear  conditioning 
while  watching  a  conspeciAc  receive  a  shock  (51).  Both  lidocaine- 
induced  inactivation  and  targeted  deletion  of  a  voltage-gated  cal¬ 
cium  channel  in  ACC  can  substantially  reduce  observational  fear 
conditioning  but  not  eliminate  it  (51).  A  dysfunction  in  empathy- 
related  processing  in  ACC  might  be  driven  by  variance-shifted  dys¬ 
functional  states,  resulting  in  degraded  sensitivities  to  process  or 
simulate  the  painful  events  of  others. 

A  link  between  ORP  and  emotional  processing  remains  elusive. 
Amygdala  is  one  of  the  primary  structures  linked  to  emotional 
processing  and  is  reciprocally  connected  to  ACC  and  OFC  (52,53). 
Amygdala  dysfunction  is  related  to  a  number  of  psychiatric  symp¬ 
toms,  including  major  depression  and  bipolar  disorder  and  affec¬ 
tive  psychosis  in  schizophrenia  (54).  Typically,  amygdala  contribu¬ 
tion  to  emotional  processing  has  been  investigated  using  fear- 
inducing  or  social  stimuli.  Monkeys  with  bilateral  amygdala  lesions 
show  abolished  fear  responses,  as  measured  by  response  latencies 
to  retrieve  food  in  the  presence  of  a  fearful  stimulus  (52,55).  Consis¬ 
tent  with  these  observations,  amygdala-lesioned  rats  completely 
lose  the  ability  to  acquire  conditioned  fear,  even  when  the  lesion 
occurs  a  month  after  the  initial  Pavlovian  training,  suggesting  a 
necessary  role  in  emotional  memory  (56,57)  (i.e.,  state-shifted  due 
to  an  absent  distribution).  Notably,  in  many  psychiatric  conditions 
involving  emotion,  the  gain  on  emotional  processing  in  amygdala 
might  be  set  too  high,  possibly  due  to  impaired  communication 
with  other  structures,  such  as  prefrontal  cortex,  that  modulate 
amygdala  activity  (58).  Such  unregulated  emotional  processing 
might  lead  to  exaggerated  behavior,  presumably  due  to  a  state 
shift.  For  example,  this  state  shift  might  result  in  a  more  responsive 
and  less  regulated  state.  The  reciprocal  information  transmission 
among  the  amygdala,  ACC,  and  OFC  (52,53)  suggests  that  the  emo¬ 
tional  component  of  ORP  may  originate  from  the  amygdala. 

Disorders  of  Decision  Making  Under  Environmental 
Volatility 

Several  neurological  and  psychiatric  disorders  compromise  the 
adaptive  abilities  of  cognitive  systems,  whether  updating  the  ex¬ 
pected  values  of  targets  according  to  task  demands  or  appropri¬ 
ately  reorienting  to  reAect  changes  in  the  environment.  Notably, 
some  cognitive  deAcits  such  as  an  inAexibility  to  adapt  to  environ¬ 
mental  changes  are  shared  across  multiple  neurological  and  psy¬ 
chiatric  conditions.  For  example,  degeneration  of  mechanisms  that 
contribute  to  adaptive  decision  making,  including  task  set  switch¬ 
ing,  task  set  maintenance,  and  inhibitory  control,  characterizes  cog¬ 
nitive  and  executive  deAcits  in  schizophrenia  (59,60).  From  a  neu- 
roeconomic  perspective,  these  may  emerge  from  failures  in 
updating  reward  valuation,  risk,  and  volatility.  In  the  Wisconsin  Card 
Sorting  Task  (WCST),  typically  used  to  probe  the  ability  to  adjust  to 
changing  environments  without  explicit  cues,  participants  sort  a 
deck  of  cards  according  to  unpredictably  changing  rules  (61).  Dur¬ 
ing  the  task,  schizophrenic  patients  perseverate  more  on  choosing 
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incorrect  responses,  persisting  longer  with  a  previous  rule  despite 
negative  feedback  (62).  These  individuals  also  show  increased  re¬ 
sponse  times  and  make  more  errors  in  the  Stroop  task  (63-67). 

Schizophrenia  is  accompanied  by  both  negative  symptoms, 
such  as  lack  of  emotion,  and  positive  symptoms,  such  as  hallucina¬ 
tions  and  delusions  (68).  In  addition,  schizophrenia  is  associated 
with  deficits  in  executive  and  cognitive  functions  (68).  Such  deficits 
include  inflexible  adjustments  in  behavioral  strategies,  or  policies, 
that  require  computing  expected  value  of  reinforcers  on  the  basis 
of  the  accumulation  of  evidence  over  time,  assessment  of  value  on 
the  basis  of  reinforcer  identity,  and  projecting  these  evaluations 
into  the  future  (69).  Schizophrenic  patients  also  show  decreased 
abilities  to  stay  on  task  (70,71).  Deficits  related  to  executive  control 
are  suggested  to  be  caused  by  noisy  dopaminergic  gating  of  pre¬ 
frontal  neurons  (70).  Symptoms  in  the  domain  of  executive  control 
may  thus  reflect  variance-shifted  processing.  Positive  and  negative 
symptoms,  on  the  other  hand,  are  associated  with  exaggerated 
(e.g.,  hallucinations)  and  abolished  (e.g.,  lack  of  emotion)  process¬ 
ing,  respectively,  and  thus  are  more  consistent  with  a  state-shifted 
condition. 

In  schizophrenia,  the  posterior  cingulate  cortex  (PCC)  is  associ¬ 
ated  with  increased  default  network  connectivity,  with  the  degree 
of  enhanced  connectivity  positively  correlating  with  the  severity  of 
psychopathology,  and  these  patients  show  increased  cannabinoid 
receptor  expression  (mediating  inhibitory  neurotransmitters  like 
gamma-aminobutyric  acid)  (72,73).  A  case  study  of  lesions  in  the 
human  PCC  found  an  inability  to  adapt  to  new  environments  (74). 
Consistent  with  this,  neuronal  activity  in  monkey  PCC  tracks  the 
level  of  risk  in  changing  environments  (17)  and  is  correlated  with 
setting  a  behavioral  strategy  to  explore  or  exploit  different  options 
(75,76).  Thus,  disruptions  to  PCC  seem  to  compromise  an  ability  to 
detect  and  incorporate  discontinuities  in  environmental  statistics 
such  as  changes  in  expected  value  and  risk.  It  remains  unclear 
whether  volatility-related  deficits  in  PCC  lesions  reflect  variance-  or 
state-shifted  functionalities. 

An  explicit  task-switching  paradigm,  in  which  a  correct  response 
on  a  given  trial  or  group  of  successive  trials  is  explicitly  cued,  is  often 
used  to  investigate  executive  control.  In  such  a  task,  neurons  in  ACC 
increase  responses  following  task  switches  (77),  suggesting  sensi¬ 
tivity  to  changes  in  reward  information  used  in  executive  control. 
Lesions  to  ACC  gyrus  increase  the  frequency  of  consecutive  errors, 
whereas  more  comprehensive  lesions  in  ACC  (gyrus  and  sulcus) 
result  in  slowed  response  times,  errors  in  switching,  and  greater 
overall  consecutive  errors  (78).  Critically,  although  ACC  lesions  in¬ 
crease  switch-related  errors,  monkeys  are  still  able  to  switch  tasks 
above  the  chance  level,  suggesting  the  mechanisms  responsible  for 
cognitive  flexibility  are  not  completely  abolished  (78).  These  results 
implicate  a  variance-shifted  deficit  inducing  suboptimality  in  the 
ability  to  adapt  to  changing  environments  by  explicit  changes  in 
the  expected  values  of  the  targets. 

Perseveration  of  maladaptive  behavior  is  one  of  the  most  strik¬ 
ing  features  of  prefrontal  lesions.  Such  deficits  are  apparent  in 
environments  without  explicit  rule-changing  cues.  In  WCST,  pa¬ 
tients  with  dIPFC  lesions  fail  to  switch  to  a  correct  response  and 
instead  perseverate  on  an  incorrect  response  (79).  Indeed,  schizo¬ 
phrenia  is  associated  with  inefficient  dIPFC  function,  particularly 
with  respect  to  working  memory  (80).  Activity  of  dIPFC  neurons  in 
monkeys  is  correlated  with  the  level  of  conflict  in  WCST  (81)  and 
different  strategies  employed  within  the  task  (82,83).  In  a  WCST 
analog,  lesions  to  monkey  OFC,  ACC,  or  dIPFC  in  and  around  the 
principal  sulcus  (but  not  superior  and  medial  to  the  sulcus)  all  result 
in  fewer  uncued  rule-guided  behavioral  shifts,  though  the  animals 
still  execute  switches,  indicating  variance-shifted,  as  opposed  to 


fully  state-shifted,  dysfunction  (84).  In  contrast,  dIPFC-lesioned  an¬ 
imals  no  longer  show  a  stereotypical  increase  in  response  times  as  a 
function  of  conflict,  an  abolition  of  conflict-induced  changes  in 
motor  responses  (81),  consistent  with  the  full  destruction  of  con¬ 
flict-detection  mechanisms  in  dIPFC  (state-shifted).  Conflict  detec¬ 
tion  and  resolution  in  these  tasks  may  map  onto  running  calcula¬ 
tions  of  instantaneous  utility  and  uncertainty,  though  this  remains  a 
topic  of  ongoing  debate.  By  perturbing  circuits  that  detect  conflict 
or  encode  strategy,  dIPFC  damage  leads  to  a  computational  defi¬ 
ciency  in  value  updating  for  flexible  environmental  adaptation. 

Conclusions 

We  are  just  beginning  to  understand  what  constitutes  a  psychi¬ 
atric  disease.  Neuroeconomic  studies  in  animals  provide  new  in¬ 
sights  into  the  affected  neural  circuits  (85).  Our  proposed  classifica¬ 
tion  scheme  establishes  a  new  framework  for  thinking  about 
psychiatric  disorders  formulated  in  the  language  of  neural  circuits. 
It  remains  to  be  seen  how  the  circuit-based  classification  could 
augment  the  existing  typological  schemes  to  help  assess  and  treat 
psychiatric  disorders.  As  a  first  step,  we  have  focused  on  deficits  tied 
to  specific  breakdowns  in  selected  neural  circuits.  Some  deficits  are 
shared  and  thus  might  appear  in  multiple  classically  defined  ill¬ 
nesses.  Our  interpretation  is  intended  to  point  out  that  what  super¬ 
ficially  might  appear  to  be  very  different  syndromes  may,  in  fact, 
share  common  disruptions  in  the  underlying  neural  circuitry. 

Psychopathological  symptoms  can  be  approached  based  on  the 
precise  type  of  deficits  induced  in  neural  circuits.  A  neural  circuit  will 
show  different  outputs  depending  on  the  affected  circuit  compo¬ 
nents.  A  noisy  state  broadens  the  width  of  the  output  distribution, 
leading  to  suboptimal  performance,  but  may  not  alter  the  basic 
functionality  of  a  given  circuit.  In  contrast,  a  circuit  could  break 
down  or  be  extensively  modified,  introducing  a  new  state  into  the 
system  with  abnormal  or  absent  functionalities  that  are  qualita¬ 
tively  different  from  the  norm. 

Most  psychiatric  disorders  present  compound  symptoms.  It  is 
not  surprising  then  that  a  single  psychiatric  illness  arises  from  a 
combination  of  variance-shifted  and  state-shifted  circuit  dysfunc¬ 
tions,  involving  multiple  brain  areas.  For  example,  under  a  connec- 
tionist  neural  network  framework,  variance-shifted  dysfunctions 
may  result  from  damages  to  peripheral  processing  nodes.  When 
the  most  critical  region  of  the  distributed  network  is  disrupted, 
however,  we  may  observe  a  fully  compromised,  state-shifted  dys¬ 
function  instead  (though  the  deficits  may  eventually  be  restored  by 
other  areas  in  the  network  on  a  longer  time  scale).  Note  that  there 
are  clear  cases  of  state-shifted  psychopathology  when  the  deficits 
are  not  due  to  targeted  traumatic  brain  injury.  For  exam  pie,  in  visual 
or  auditory  hallucinations,  commonly  found  with  severe  schizo¬ 
phrenia,  individuals  experience  percepts  in  the  absence  of  actual 
sensory  signals.  The  circuits  that  mediate  these  experiences  are 
clearly  behaving  very  differently  and  seem  likely  to  be  induced  by  a 
state-shifted  process. 

Our  circuit-based  scheme  may  be  relevant  for  the  ongoing  de¬ 
bate  in  psychiatry  over  the  need  for  incorporating  dimensional 
diagnosisto  traditional  categorical  diagnosis  (86-90). The  variance- 
and  state-shifted  models  effectively  redescribe  such  dimensional 
criteria  at  the  level  of  neural  circuits.  For  example,  the  severity  or 
idiosyncrasy  of  a  given  symptom  for  a  given  individual  could  be 
linked  to  either  the  degree  of  variance  shift  (e.g.,  the  magnitude  of 
change  in  the  variance  of  the  distribution)  or  the  degree  of  state 
shifts  (e.g.,  the  magnitude  of  mean  shifts  in  the  distribution)  in 
behavioral  or  cognitive  output  according  to  the  proposed  scheme. 
Translating  psychiatric  symptoms  into  dimensional  outcomes  of 
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neural  circuit  dysfunction  may  open  up  new  avenues  for  improved 
therapeutic  intervention. 

The  circuit-based  classification  does  not  describe  a  relationship 
between  implicated  circuits  and  psychiatric  disorder  types.  Our 
classification  scheme,  which  critically  depends  on  our  understand¬ 
ing  of  the  functionality  of  a  given  circuit,  is  not  intended  to  replace 
existing  typologies  of  psychopathology.  Rather,  it  describes  a 
mechanistic  relationship  between  implicated  circuits  and  behav¬ 
ioral  deficits  caused  by  failures  of  those  circuits.  In  our  view,  the 
current  scheme  can  provide  easily  quantifiable  grounds  for  hypoth¬ 
esis  testing  for  linking  a  circuit-level  dysfunction  and  an  afflicted 
behavior  (e.g.,  Supplement  1)  and  thus  may  provide  novel  insights 
into  the  mechanistic  dysfunctions  underlying  psychiatric  condi¬ 
tions. 
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Mechanistic  Classification  of  Neural  Circuit  Dysfunctions:  Insights  from 
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Supplemental  Information 

An  Example  of  Hypothesis  Testing  Under  the  Variance-  and  State-Shifted  Framework 

Lesions  to  the  dorsolateral  prefrontal  cortex  (dlPFC)  in  monkeys  impair  executive  control 
(1).  The  posterior  cingulate  cortex  (PCC)  has  been  implicated  in  tracking  volatility  in  the 
environment  (2).  Both  dlPFC  and  PCC  are  implicated  in  schizophrenia  (see  Main  Text). 
However,  it  remains  unknown  how  the  two  areas  interact  to  exert  flexible  cognitive  control. 
Recording  neuronal  activity  from  PCC  after  a  dlPFC  lesion  could  provide  a  unique  opportunity 
to  test  how  the  executive  control  impairments  due  to  a  dlPFC  lesion  affect  volatility-tracking 
signals  in  PCC.  More  precisely,  comparing  the  response  profiles  of  PCC  neurons  before  and  after 
the  dlPFC  lesion  could  reveal  whether  the  prefrontal  lesion  induces  either  variance-shifted  or 
state-shifted  dysfunctions  in  PCC  neurons.  Similarly,  a  neuroimaging  experiment  could  test,  in 
individuals  with  schizophrenia  who  show  abnormal  dlPFC  and  PCC  metabolic  activity,  whether 
and  how  (e.g.,  variance-  or  state-shifted)  the  activations  in  dlPFC  and  PCC  are  functionally 
linked.  Results  from  studies  like  these  can  help  reveal  novel  insights  into  how  certain  circuits 
malfunction  during  specific  behaviors  being  tested.  Unlike  traditional  classification  schemes  of 
psychiatric  symptoms,  the  current  circuit-based  scheme  can  provide  straightforward  testable 
grounds  for  understanding  how  a  given  circuit  dysfunction  might  be  related  to  behavioral  deficits 
observed  in  psychiatric  conditions. 
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Neuronal  reference  frames  for  social  decisions 


Abstract 

1  Social  decisions  play  a  crucial  role  in  the  success  of  individuals  and  the  groups  they  compose. 

2  Group  members  respond  vicariously  to  benefits  obtained  by  others,  and  impairments  in  this 

3  capacity  contribute  to  neuropsychiatric  disorders  like  autism  and  sociopathy.  We  studied  how 

4  neurons  in  three  frontal  cortical  areas  encode  the  outcomes  of  social  decisions  as  monkeys 

5  performed  a  reward-allocation  task.  Neurons  in  the  orbitofrontal  cortex  (OFC)  predominantly 

6  encoded  rewards  delivered  to  oneself.  Neurons  in  the  anterior  cingulate  gyrus  (ACCg)  encoded 

7  reward  allocations  to  the  other  monkey,  reward  allocations  to  oneself,  or  both.  Neurons  in  the 

8  anterior  cingulate  sulcus  (ACCs)  signaled  reward  allocations  to  the  other  monkey  or  no  one. 

9  Within  this  network  of  received  (OFC)  and  foregone  (ACCs)  reward  signaling,  ACCg  emerges 

10  as  a  key  nexus  for  the  computation  of  shared  experience  and  social  reward.  Individual  and 

1 1  species-specific  variations  in  social  decision-making  might  result  from  the  relative  activation  and 

12  influence  of  these  areas. 
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1  Social  cohesion  depends  on  vicarious  identification  with  members  of  one’s  group.  In 

2  social  situations,  we  are  aware  of  our  actions  and  their  consequences  but  also  consider  those  of 

3  others,  especially  those  with  whom  we  might  interact1.  We  also  estimate  the  internal  states  of 

4  others,  perhaps  by  simulation2,  which  in  turn  shapes  our  future  actions.  Social  situations  can 

5  drive  observational  learning3,  and  other-regarding  preferences  influence  neural  computations  that 

6  ultimately  result  in  cooperation,  altruism,  or  spite4'5.  Disruptions  of  neural  circuits  involved  in 

7  other-regarding  processes  may  underlie  social  deficits  attending  neuropsychiatric  conditions  like 

8  autism6.  Human  imaging  and  clinical  studies  have  found  critical  links  between  social  deficits  and 

9  abnonnal  brain  activity  in  frontal  cortex  and  its  subcortical  targets7. 

10  Neural  circuits  involved  in  reinforcement  learning  and  decision-making  are  crucial  for 

11  nonnal  social  interactions8.  Critical  nodes  include  the  anterior  cingulate  cortex9-11,  the 

12  orbitofrontal  cortex12-17,  and  subcortical  areas  such  as  the  dopaminergic  ventral  tegmental  area 

13  and  substantia  nigra  ’  ,  the  striatum  -  ,  the  lateral  habenula",  and  the  amygdala"  . 

14  Neuro imaging  studies  in  humans  report  activation  of  some  of  these  areas  by  both  giving  rewards 

15  and  receiving  rewards24-28,  and  lesions  to  some  of  these  areas  result  in  impaired  social  decision- 

16  making7.  These  findings  thus  suggest  a  generic  circuit  for  reward-guided  learning  and  decision- 

17  making  mediates  social  decisions8.  Despite  this  evidence,  and  the  clear  clinical  relevance  of 

18  understanding  the  neurobiology  of  social  decision-making,  precisely  how  neurons  in  any  of  these 

19  areas  compute  social  decisions  remains  unknown,  largely  due  to  difficulties  in  implementing 

20  social  interactions  while  simultaneously  studying  neuronal  activity  and  controlling  contextual 

21  variables.  Single  unit  recording  studies  in  nonhuman  animals,  such  as  macaques,  making  social 

22  decisions  of  similar  complexity  to  those  made  by  humans  would  help  address  this  gap. 
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1  To  address  this  gap,  we  implemented  a  reward-allocation  task  in  pairs  of  rhesus 

2  macaques  while  at  the  same  time  recording  from  single  neurons  in  three  critical  nodes  in  the 

3  decision-making  network,  namely  the  anterior  cingulate  gyrus  (ACCg),  the  anterior  cingulate 

4  sulcus  (ACCs),  and  the  orbitofrontal  cortex  (OFC).  Our  study  capitalized  on  monkeys’ 

5  willingness  to  engage  with  a  social  partner  via  an  interposed  computer  system  while  at  the  same 

6  time  controlling  the  sensory  and  reward  environment.  We  specifically  matched  choices  for  the 

7  reward  outcomes  directly  received  by  the  actor  monkey  and  controlled  for  potential  secondary 

8  acoustic  reinforcement  effects  associated  with  delivering  juice  to  the  recipient  monkey  (see 

9  below).  Under  these  conditions,  we  found  regional  biases  in  the  encoding  of  social  decision 

10  outcomes  with  respect  to  self  and  another  individual.  Within  this  network  of  received  (OFC)  and 

1 1  foregone  (ACCs)  reward  signals,  ACCg  emerges  as  a  key  nexus  for  the  computation  of  shared 

12  experience  and  social  reward. 

13 

14  Summary  of  behavior  in  the  reward-allocation  task 

1 5  On  one  half  of  trials,  tenned  choice  trials,  actor  monkeys  chose  between  visual  stimuli 

16  that  led  to  juice  delivered  either  to  themselves  (s elf  reward),  to  the  recipient  monkey  ( other 

17  reward),  or  to  neither  monkey  ( neither  reward).  Offers  appeared  in  pairs  of  three  types,  which 

18  defined  Self:Neither  trials,  Self: Other  trials,  and  Other:Neither  trials  (Fig.  la-d).  On  the  other 

19  half,  termed  cued  trials,  monkeys  observed  a  single  cue  that  indicated  self,  other  or  neither 

20  rewards  would  be  delivered  by  the  computer,  as  defined  above. 

21  Actors  perfonned  the  task  well  (Fig.  2a),  as  indicated  by  the  low  mean  number  of 

22  incomplete  trials  per  session  (4.6  ±  0.2%  [s.e.m.])  (Online  Methods),  even  when  they  had  no 

23  chance  of  obtaining  juice  rewards  themselves,  which  was  the  case  for  Other:Neither  choice  trials 
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1  and  for  other  and  neither  cued  trials  (7.4  ±  0.3%).  Actors  also  made  significantly  fewer  errors 

2  when  they  made  active  decisions  (choice  trials)  then  when  there  was  no  choice  (cued  trials), 

3  when  there  was  no  reward  at  stake  for  themselves  ( P  <  0.0001,  Welch  two  sample  /-test).  These 

4  findings  suggest  monkeys  find  it  rewarding  to  actively  choose  what  to  do,  and  can  be  motivated 

5  to  work  without  direct  reinforcement. 

6  Reaction  times  often  serve  as  a  proxy  for  motivation  in  incentivized  tasks29-33.  Reaction 

7  times  for  making  different  choices  demonstrate  that  actors  discriminated  the  reward  types  and 

8  had  orderly  preferences  amongst  them29’33.  Actors  were  fastest  to  choose  self  rewards,  followed 

9  by  other  rewards  and  neither  rewards  (Fig.  2b).  Self  \ s.  other  reaction  times  differed  by  a  mean 

10  of  39  ms  (P  <  0.000 1 ;  Welch  two-sample  /-test);  other  vs.  neither  differed  by  a  mean  of  20  ms  (P 

11  <  0.0001).  The  ordered  reaction  times  by  monkeys  making  choices  in  the  reward  allocation  task 

12  suggest  that  rewarding  self  is  more  reinforcing  than  rewarding  the  recipient,  which  is  in  turn 

13  more  reinforcing  than  rewarding  no  one33. 

14  Finally,  actors  shifted  gaze  to  the  recipients  more  frequently  following  juice  delivery  to 

15  them  compared  to  juice  delivery  to  themselves  or  to  neither  monkey,  consistent  with  greater 

16  interest  in  the  actions  of  the  other  monkey  when  he  was  rewarded  (Supplementary  Figure  1). 

17  Taken  together,  these  observations  support  the  conclusion  that  actors  were  acutely  aware  of  the 

18  difference  between  self  other,  and  neither  reward  outcomes33. 

19  We  quantified  decision  preferences  by  calculating  a  contrast  ratio  based  on  actors’ 

20  choices  (Online  Methods  Eq.  1).  Consistent  with  our  previous  reports33'34,  actors  preferred  self 

21  rewards  over  other  or  neither  rewards,  but  preferred  other  over  neither  rewards  (Fig.  2c).  On 

22  Self: Neither  and  Self:  Other  trials,  actors  almost  always  chose  to  reward  self  (Fig.  2c)  (preference 

23  index  [mean  ±  s.e.m.]:  SelfNeither,  -0.99  ±  0.00;  SelfOther,  -0.99  ±  0.00;  significantly 
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1  different  from  zero:  both  P  <  0.0001,  one  sample  t-  test).  By  contrast,  on  Other  .'Neither  trials, 

2  actors  preferred  to  allocate  rewards  to  the  recipient  monkey  (Fig.  2c)  (0.17  ±  0.01;  P  <  0.0001, 

3  one  sample  /-test).  We  observed  similar  choice  preferences  for  each  actor  individually 

4  (Supplementary  Figure  2). 

5  We  previously  reported  that  the  preference  to  allocate  reward  to  the  other  monkey  is 

6  enhanced  by  greater  familiarity  between  the  two  animals,  and  is  abolished  if  the  recipient  is 

7  replaced  with  a  juice  collection  bottle33.  We  also  reported  that  reward  withholding  is  reduced 

8  when  actor  monkeys  are  dominant  toward  recipients,  and  the  variability  and  the  degree  of 

9  preferences  often  depend  on  the  identity  of  the  recipients33.  Furthermore,  we  reported  that  actor 

10  monkeys  prefer  to  deliver  juice  to  themselves  compared  to  both  themselves  and  the  recipient 

1 1  simultaneously,  perhaps  reflecting  the  competitive  nature  of  simultaneously  drinking  juice — a 

12  resource  controlled  outside  of  experimental  sessions  in  order  to  motivate  performance  and  often 

13  monopolized  by  dominant  monkeys  living  in  pairs  with  subordinate  monkeys  in  their  home 

14  cages33  (MLP,  personal  observation).  Finally,  exogenously  increasing  oxytocin  levels  in  the 

15  central  nervous  system  amplifies  actors’  preference  to  allocate  reward  to  the  other  monkey  over 

16  no  one34.  Taken  together,  these  patterns  of  behavior  endorse  the  fundamentally  social  nature  of 

17  the  reward-allocation  task. 

18  We  also  found  that  preferences  scaled  with  the  magnitude  of  juice  on  offer.  With  larger 

19  amounts  of  juice  at  stake,  actors  became  more  motivated  to  receive  ( Self: Neither  &  Self:  Other, 

20  slope  significantly  different  from  zero:  both  P  <  0.001,  type  II  regression)  and  also  to  allocate 

21  rewards  to  the  other  monkey  over  no  one  (Other: Neither,  P  <  0.05)  (Fig.  2d).  These  findings 

22  suggest  that  both  direct  and  vicarious  reinforcement  processes  that  motivate  social  decisions  are 

23  magnified  by  reward  magnitude25-27. 
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1 

2  Differential  encoding  of  social  decision  outcomes 

3  We  recorded  the  activity  of  single  neurons  in  ACCg  ( n  =  81),  ACCs  (n  =  101),  and  OFC 

4  (n  =  85)  from  two  actor  monkeys  (Fig.  3a)  during  the  reward-allocation  task.  We  describe 

5  neuronal  responses  from  typical  single  neurons  and  the  populations  below  for  each  region.  We 

6  analyzed  the  data  for  both  a  choice/cue  epoch  and  a  reward  epoch  (Online  Methods). 

7  Supplementary  Figure  3  shows  population  data  for  the  individual  monkeys.  For  brevity,  we 

8  focus  on  the  reward  epoch;  data  for  the  choice/cue  epoch  are  found  in  Supplementary  Figure  4, 

9  as  well  as  in  Figures  3  and  4.  Overall,  we  found  remarkable  resemblances  in  activity  and 

10  functional  classes  (see  below)  across  the  choice  and  reward  epochs. 

1 1  ACCg :  ACCg  contained  neurons  selective  for  allocating  rewards  to  another 

12  individual,  receiving  rewards,  or  both.  One  class  of  ACCg  neuron  (Fig.  3b)  preferentially 

13  responded  when  actors  chose  to  allocate  reward  to  recipients.  On  choice  trials,  this  example 

14  neuron  discharged  more  strongly  when  the  actor  chose  other  rewards  (7.12  ±  0.66  [mean  and 

15  s.e.m.]  spikes/s  [sp/s])  compared  to  self  rewards  on  either  Self:Neither  or  Self: Other  trials  (4.95  ± 

16  0.36,  4.93  ±  0.45  sp/s,  respectively)  (both  P  <  0.01,  Welch  two  sample  f-test),  and  also  preferred 

17  other  rewards  over  neither  rewards  (4.44  ±  0.79  sp/s,  P  <  0.05).  This  neuron  did  not  differentiate 

18  self  from  neither  rewards  ( P  =  0.97,  Welch  two  sample  /-test).  On  cued  trials,  this  neuron  only 

19  weakly  preferred  other  over  self  ox  neither  rewards  (both  P  =  0.08,  Welch  two  sample  /-test) 

20  (Fig.  3b). 

21  By  contrast,  another  class  of  ACCg  neuron  (example  neuron  in  Fig.  3c)  responded 

22  selectively  for  choosing  self  rewards.  The  example  neuron  in  Figure  3c  neuron  discharged  more 

23  when  the  actor  chose  to  reward  himself  on  SelfNeither  and  SelfOther  trials  (4.77  ±  0.38,  5.70  ± 
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1  0.41  sp/s,  respectively)  compared  to  choosing  other  and  neither  rewards  (2.02  ±  0.32,  1.60  ± 

2  0.39  sp/s,  respectively)  (all  P  <  0.0001,  Welch  two  sample  f-test).  Moreover,  it  showed  stronger 

3  responses  when  the  actor  received  rewards  in  Self:  Other  than  SelfNeither  context,  but  this  effect 

4  did  not  reach  statistical  significance  ( P  =  0.10,  Welch  two  sample  /-test).  On  cued  trials,  this 

5  neuron  preferred  self  over  other  or  neither  rewards  (both  P  <  0.0001,  Welch  two  sample  /-test). 

6  For  both  choice  and  cued  trials,  the  response  did  not  differentiate  other  and  neither  rewards  (both 

7  P  >  0.23,  Welch  two  sample  /-test). 

8  Finally,  a  third  class  of  ACCg  neuron  (example  neuron  in  Fig.  3d)  responded 

9  equivalently  to  both  received  rewards  ( SelfNeither ,  15.28  ±  0.70,  SelfOther,  16.47  ±  0.81  sp/s) 

10  and  allocated  rewards  to  other  (15.81  ±1.16  sp/s)  (both  P  >  0.64,  Welch  two  sample  f-test),  but 

11  responded  significantly  less  to  neither  rewards  (10.17  ±  1.23  sp/s;  other  vs.  neither  and  self  vs. 

12  neither,  both  P  <  0.005).  Similarly,  on  cued  trials,  this  neuron  preferred  other  over  neither 

13  rewards  (P  <  0.05,  Welch  two  sample  /-test),  but  did  not  differentiate  between  self  and  other 

14  rewards  ( P  =  0.27). 

15  Importantly,  the  fact  that  the  solenoid  valves  controlling  juice  delivery  (including  one  for 

16  neither  rewards  that  only  produced  clicks)  were  placed  outside  the  experimental  room,  as  well  as 

17  the  white  noise  played  inside  the  room,  during  sessions  rules  out  a  simple  explanation  that  other- 

18  reward  specific  (Fig.  3b)  and  shared  self  other  reward  responses  (Fig.  3d)  were  merely  sensory 

19  responses  to  the  sounds  of  the  reward-delivery  mechanism. 

20  To  contrast  population  coding  of  decision  and  reward  information  in  various  conditions, 

21  we  computed  a  nonnalized  activity  bias  between  each  pair  of  outcomes,  expressed  as  a 

22  proportional  modulation  in  mean  firing  rates  nonnalized  by  baseline  firing  rate.  In  the  ACCg 

23  population,  the  mean  normalized  activity  bias  for  other  over  neither  rewards  ( other  vs.  neither) 
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1  was  0.21  ±  0.10  (s.e.m.),  i.e.,  a  21%  difference,  which  was  significant  (P  <  0.05,  paired  /-test) 

2  (Fig.  3e,  5a).  Similarly,  the  bias  for  .sc//' (from  Self:Other )  over  neither  rewards  was  0.20  ±  0.12 

3  (P  =  0.09,  paired  t-test).  Notably,  the  population  showed  equivalent  responses  for  self  rewards 

4  {Self: Other)  and  other  rewards  (0.01  ±  0.12,  P  =  0.96,  paired  /-test).  On  the  other  hand,  it  showed 

5  a  significant  bias  for  self  rewards  when  the  actors  were  presented  with  a  choice  between 

6  rewarding  themselves  and  recipients  compared  to  when  the  actors  were  presented  with  a  choice 

7  between  rewarding  themselves  and  no  one  {SelfOther  vs.  Self:Neither,  by  0.17  ±  0.08,  P  <  0.05, 

8  paired  /-test),  suggesting  that  ACCg  is  particularly  sensitive  to  a  reward  context  involving  an 

9  option  to  reward  another  individual.  Thus,  the  ACCg  population  showed  an  equivalent 

10  preference  for  other  and  self  rewards,  and  preferred  both  over  neither  rewards. 

1 1  On  cued  trials,  however,  a  strikingly  different  pattern  emerged.  The  population  responded 

12  strongly  to  self  rewards  but  barely  responded  to  other  rewards  (0.59  ±  0.32,  P  =  0.07,  paired  /- 

13  test)  (Fig.  3e).  Furthennore,  the  population  now  responded  no  differently  to  other  and  neither 

14  rewards  (0.22  ±  0. 14,  P  =  0. 14,  paired  /-test). 

15  Taken  together,  these  results  indicate  that  ACCg,  as  a  population,  encodes  both  giving 

16  and  receiving  rewards.  At  the  population  level,  neuronal  activity  selective  for  allocating  rewards 

17  to  another  individual  is  specific  to  active  decisions  (upper  vs.  lower:  Fig.  3e),  similar  to  what  has 

18  been  reported  by  fMRI  of  human  ventral  striatum  during  voluntary  versus  forced  charitable 

19  donations25.  The  confluence  of  neurons  selectively  responsive  to  self,  other,  and  both  {self  and 

20  other)  rewards  in  ACCg  suggests  this  area  contains  the  information  necessary  to  mediate  the 

21  vicarious  reinforcement  processes  that  appear  to  motivate  actors  to  give  to  recipients. 

22  ACCs :  Fig.  4a  shows  a  typical  ACCs  neuron  that  fired  more  strongly  preceding  other 

23  and  neither  rewards  than  self  rewards.  On  choice  trials,  this  neuron  discharged  more  strongly 
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1  when  the  actor  chose  not  to  reward  himself  ( other  rewards,  19.64  ±  2.15;  neither  rewards,  18.19 

2  ±  2.03  sp/s)  compared  to  when  he  chose  to  reward  himself  directly  {Self: Neither,  10.31  ±  0.86; 

3  Self: Other,  9.79  ±  0.81  sp/s)  (all  P  <  0.001,  Welch  two  sample  /-test).  This  neuron  responded 

4  equivalently  to  self  rewards  in  SelfOther  and  SelfNeither  contexts  (P  =  0.66,  Welch  two  sample 

5  /-test),  and  also  responded  equivalently  to  other  and  neither  rewards  (P  =  0.62),  consistent  with 

6  encoding  “foregone”  rewards.  On  cued  trials,  this  neuron  responded  equivalently  to  other  and 

7  neither  rewards  ( P  =  0.39,  Welch  two  sample  /-test),  but  less  to  self  rewards  (both  P  <  0.005), 

8  resembling  the  responses  to  active  decisions. 

9  Likewise,  the  ACCs  population  showed  a  strong  and  equivalent  response  bias  for 

10  foregone  rewards  (self  v  s.  other,  activity  bias=0.31  ±  0.07;  self  vs.  neither,  activity  bias=0.25  ± 

1 1  0.08,  both  P  <  0.005,  paired  /-test)  (Fig.  4c,  5b).  The  population  did  not  differentiate  other  from 

12  neither  rewards  (0.06  ±  0.06,  P  =  0.31,  paired  /-test).  Unlike  ACCg,  the  population  did  not 

13  respond  differentially  to  SelfOther  and  SelfNeither  contexts  (differed  by  0.003  ±  0.02,  P  =  0.90, 

14  paired  t-test).  We  found  similar  patterns  on  cued  trials  -  responses  to  self  rewards  were 

15  substantially  reduced  compared  to  other  rewards  (0. 19  ±  0.09,  P  <  0.05,  paired  /-test)  and  neither 

16  rewards  (0.18  ±  0.10,  P  <  0.08)  (Fig.  4c).  These  results  indicate  that,  during  social  interactions, 

17  ACCs  neurons  predominantly  signal  foregone  rewards. 

18  OFC :  Fig.  4b  shows  a  typical  OFC  neuron  that  preferentially  encoded  juice  rewards 

19  received  by  the  actor.  On  choice  trials,  this  neuron  discharged  significantly  more  for  self  rewards 

20  than  for  the  alternatives  on  both  SelfNeither  and  SelfOther  trials.  Activity  for  self  rewards  did 

21  not  differ  between  the  two  self  reward  contexts  (7.00  ±  0.47,  7.03  ±  0.46  sp/s,  respectively,  P  = 

22  0.97,  Welch  two  sample  /-test),  but  it  exceeded  the  cell’s  activity  for  other  and  neither  rewards 

23  (3.06  ±  0.40,  1.85  ±  0.42  sp/s,  respectively;  both  P  <  0.0001).  On  cued  trials,  this  neuron 
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1  responded  most  strongly  to  self  rewards  compared  to  both  other  and  neither  rewards  (both  P  < 

2  0.0001,  Welch  two  sample  /-test),  but  it  did  not  respond  differently  between  other  and  neither 

3  rewards  (P  =  0.25)  (Fig.  4b). 

4  The  OFC  population  predominantly  encoded  self  rewards  compared  to  other  and  neither 

5  rewards.  The  bias  for  self  over  other  rewards  was  30%  (0.30  ±  0.09,  P  <  0.005,  paired  /-test).  For 

6  self  versus  neither  rewards,  the  bias  was  also  significant  (0.17  ±  0.08,  P  <  0.05,  paired  /-test) 

7  (Fig.  4d,  5c).  Population  activity  for  other  and  neither  rewards  did  not  differ  (0.08  ±  0.06,  P  = 

8  0.20,  paired  /-test)  (Fig.  4d,  5c).  Unlike  ACCg,  the  population  did  not  respond  differentially  to 

9  SelfOther  and  SelfNeither  contexts  (differed  by  0.06  ±  0.07,  P  =  0.39,  paired  t-test).  On  cued 

10  trials,  the  self  reward  bias  was  not  present  compared  to  other  rewards  (0.19  ±  0.16,  P  =  0.24, 

11  paired  /-test)  and  only  weakly  present  over  neither  rewards  (0.26  ±  0.15,  P  <  0.08).  On  cued 

12  trials,  the  population  did  not  distinguish  other  rewards  from  neither  rewards  ( P  =  0.33,  paired  /- 

13  test)  (Fig.  4d).  These  results  indicate  that  OFC  neurons  predominantly  encode  rewards  received 

14  by  the  actors,  and  this  information  was  encoded  more  faithfully  during  active  decision-making. 

15 

16  Neuronal  reference  frames  for  social  decisions 

17  Neuro imaging  and  scalp-recording  studies  in  humans  can  only  study  neuronal  activity  at 

18  an  aggregate  level.  Our  single-unit  recording  data  thus  provide  a  unique  opportunity  to  quantify 

19  the  frame  of  reference  in  which  individual  neurons  within  ACCg,  ACCs,  and  OFC  encode  social 

20  decisions.  To  do  this,  we  classified  cells  from  each  area  based  on  an  analysis  of  variance 

21  (ANOVA)  of  neuronal  activity  of  individual  neurons  with  reward  outcome  {self  other,  or 

22  neither),  trial  type  (choice  or  cued),  and  reward  magnitude  (small,  medium,  or  large)  as  factors 

23  (Online  Methods).  Reward  epoch  responses  differed  significantly  for  a  large  number  of  neurons 
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1  from  all  areas  in  a  manner  that  depended  on  reward  outcome  (ACCg:  57%,  ACCs:  72%,  OFC: 

2  57%),  trial  type  (ACCg:  36%,  ACCs:  52%,  OFC:  45%)  and  reward  volume  (ACCg:  12%,  ACCs: 

3  25%,  OFC:  24%)  (Supplementary  Table  1).  Furthermore,  we  observed  remarkable 

4  resemblances  in  reward  outcome  coding  across  the  choice/cue  and  reward  epochs 

5  (Supplementary  Figure  4). 

6  Based  on  the  statistical  significance  of  the  ANOVA  during  the  choice/cue  and  reward 

7  epochs,  we  classified  individual  neurons  as  self-referenced  (i.e.,  modulation  referenced  to  self 

8  rewards,  preferring  either  self  or  foregone  rewards),  other-referenced  (i.e.,  modulation  referenced 

9  to  other  rewards),  both-referenced  (i.e.,  modulation  referenced  to  both  self  and  other  rewards,  but 

10  not  neither  rewards),  or  unclassified  (Online  Methods).  Here  we  consider  the  proportion  of 

1 1  different  cell  types  among  the  classified  neurons  based  on  this  scheme.  In  OFC,  80%  (n  =  36/45) 

12  were  self-referenced,  whereas  only  9%  (4/45)  were  other-referenced  and  11%  (5/45)  were  both- 

13  referenced  (both  P  <  0.0001,  y  test).  In  ACCs,  72%  (51/71)  were  self-referenced,  whereas  only 

14  14%  (10/71)  were  other-referenced  and  14%  (10/71)  were  both-referenced  (both  P  <  0.0001,  y 

15  test)  (Fig.  5d).  In  contrast,  ACCg  contained  similar  proportions  that  were  self-referenced  (38%, 

16  12/32),  other-referenced  (31%,  10/32),  and  both-referenced  (31%,  10/32)  (P  >  0.79,  y2  test). 

17  Critically,  ACCg  contained  a  significantly  higher  proportion  of  neurons  (>60%)  that  were 

18  sensitive  to  the  reward  outcome  of  the  recipient  monkey  (i.e.,  other-referenced  and  both- 

19  referenced)  compared  to  either  OFC  or  ACCs  (both  P  <  0.005,  yj  test)  (Fig.  5d).  ACCg  also 

20  contained  a  significantly  smaller  proportion  of  self-referenced  neurons  than  either  OFC  or  ACCs 

21  (both  P  <  0.005,  y  test).  Finally,  we  found  similar  results  when  we  repeated  the  analysis  by 

22  including  trial-by-trial  choice  reaction  times  as  covariates  (Supplementary  Figure  5). 
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1  To  test  whether  different  neuronal  frames  of  reference  (self-,  other-,  and  both-referenced) 

2  were  anatomically  segregated,  we  used  principal  component  analysis  on  recording  coordinates  to 

3  identify  the  major  axis  with  the  largest  dispersion  within  three-dimensional  space.  We  then 

4  projected  neurons  to  that  axis  to  test  differential  distributions  in  individual  monkeys  separately. 

5  Figure  6  shows  reconstructed  recording  locations  for  each  reference  frame  class  for  each  area. 

6  We  did  not  observe  any  systematic  anatomical  clustering  amongst  different  frames  of  reference: 

7  self-,  other-,  and  both-referenced  neurons  within  ACCg,  ACCs,  and  OFC  were  intermingled  (all 

8  P  >  0.56,  Wilcoxon  rank  sum  test). 

9  Next  we  examined  whether  differential  encoding  of  self,  other,  and  neither  rewards  was 

10  also  present  prior  to  making  a  decision.  We  found  very  little  evidence  for  systematic  signals 

1 1  early  in  the  trial  just,  after  target  onset  (from  50ms  to  250ms  from  target  onset).  In  ACCg,  only  0, 

12  3,  and  1  cells  were  classified  into  self-,  other-,  and  both-referenced  classes  with  only  12%  of 

13  neurons  showing  significant  effect  of  reward  type.  In  ACCs,  only  1,  2,  and  3  cells  belonged  to 

14  each  category,  with  only  22%  of  the  neurons  with  significant  reward  type  effects.  Similarly,  in 

15  OFC,  only  2,  2,  and  4  cells  belonged  to  each  category,  with  only  28%  of  the  neurons  with 

16  significant  reward  type  effects.  Thus,  in  our  reward  allocation  task,  signals  in  ACCg,  ACCs,  and 

17  OFC  appear  to  emerge  around  the  time  of  choice  and  reward  delivery. 

18  Finally,  we  examined  whether  session-to-session  variation  in  prosocial  tendencies  on 

19  Other  .Neither  trials  (Fig.  2c)  could  be  explained  by  variability  in  the  responses  of  ACCg 

20  neurons — the  population  most  sensitive  to  other’s  rewards.  We  split  recording  sessions  based  on 

21  actors’  choices  on  Other :Neither  into  two  categories:  more  prosocial  (higher  other  over  neither 

22  choices  relative  to  the  median  preference  index)  and  less  prosocial  (lower  other  over  neither 

23  choices  relative  to  the  median  preference  index).  Actors  tended  to  be  more  prosocial  on 
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1  recording  sessions  when  other-referenced  and  both-referenced  ACCg  neurons  showed  less 

2  variability  in  spiking  during  the  reward  epoch  (P  <  0.05,  bootstrap  test)  (Fig.  7a).  By  contrast, 

3  we  found  that  self-referenced  ACCg  neurons  generated  more  variable  responses  during  the 

4  reward  epoch  when  actors  were  more  prosocial  (P  <  0.05,  bootstrap  test).  ACCs  neurons  did  not 

5  show  any  systematic  relationship  between  response  variance  and  behavior  ( P  =  0.47,  bootstrap 

6  test;  Fig.  7b).  Notably,  OFC  neurons  showed  a  similar  pattern  to  self-referenced  ACCg  neurons 

7  (P  <  0.005,  bootstrap  test;  Fig.  7c).  These  findings  reveal  suggest  a  strong  link  between  prosocial 

8  behavior  and  the  fidelity  of  social  reward  signals  carried  by  those  neurons  that  incorporate  the 

9  experience  of  others  into  their  responses.  This  could  be  due  to  enhanced  attention  to  the  recipient 

10  or  other  processes  known  to  influence  signal  to  noise  in  cortical  neurons. 

11 

12  Discussion 

13  Our  findings  strongly  endorse  the  hypothesis  that  distinct  frontal  regions  contribute 

14  uniquely  to  social  decisions  by  differentially  processing  decision  outcomes  with  respect  to  actors 

15  ( self)  and  their  partners  {other).  The  finding  that  OFC  neurons  selectively  encode  self  reward  is 

16  consistent  with  previous  studies  implicating  this  area  in  representing  the  subjective  value  of 

17  rewards12,13,  but  extend  those  findings  by  demonstrating  that  such  value  signals  are  encoded 

18  egocentrically.  Encoding  of  foregone  rewards  by  ACCs  neurons,  on  the  other  hand,  is  consistent 

19  with  previous  data  implicating  this  area  in  error  monitoring  and  behavioral  adjustment35-37.  For 

20  example,  foregone  reward  signaling  by  ACCs  might  be  used  to  learn  from  observation,  rather 

21  than  direct  experience,  and  adjust  ongoing  behavior  during  social  interactions.  Furthennore, 

22  mirroring  of  self  and  other  rewards  by  ACCg  neurons  is  consistent  with  previous  studies  linking 

23  this  area  to  specifically  social  functions  like  shared  experience  and  empathy38. 
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1  Our  findings  echo  those  of  a  previous  study  examining  the  effects  of  lesions  in  these 

2  same  brain  regions  (Online  Methods),  which  demonstrated  that  ACCg,  but  not  OFC  or  ACCs, 

3  contributes  causally  to  the  use  of  visual  social  infonnation  to  guide  behavior9.  Specifically, 

4  ACCg  lesions  completely  abolished  typical  hesitation  to  retrieve  food  when  confronted  with 

5  social  stimuli9.  Our  findings  also  agree  with  previous  findings  that  lesions  in  ACCs  impair  the 

6  use  of  reward  history  to  guide  decisions  adaptively10.  Differences  between  ACCs  and  ACCg 

7  reported  here  support  and  extend  the  finding  that  learning  based  on  experience  is  mediated  by 

8  ACCs,  whereas  learning  from  feedback  from  another  individual  is  mediated  by  ACCg8. 

9  Specifically,  in  a  learning  task  in  which  human  subjects  monitored  their  history  of  correct 

10  responses  as  well  as  the  advice  given  to  them  by  a  confederate,  BOLD  activation  in  ACCs 

1 1  tracked  reward  learning  rate,  whereas  BOLD  activation  in  ACCg  tracked  social  learning  rate 

12  based  on  advice  from  the  confederate8.  In  our  study,  we  propose  that  ACCs  tracked  foregone 

13  rewards  relative  to  self,  whereas  ACCg  tracked  reward  outcomes  of  another  individual  in  a  more 

14  complex  manner. 

15  Intriguingly,  the  ACCg  population  also  responded  more  strongly  when  monkeys  chose 

16  self  reward  when  the  alternative  was  allocating  reward  to  the  other  monkey  compared  to  the 

17  response  when  monkeys  chose  self  reward  when  the  alternative  was  rewarding  no  one.  In 

18  contrast,  neither  the  OFC  neuronal  population  response  nor  the  ACCs  neuronal  population 

19  response  was  sensitive  to  social  context  when  monkeys  rewarded  themselves.  Sensitivity  to 

20  social  context  in  ACCg  endorses  a  specialized  role  for  this  area  in  computing  social  decisions  - 

21  even  when  one  acts  selfishly. 

22  It  is  worthwhile  to  note  that  a  small  number  of  ACCs  and  OFC  neurons,  though  much 

23  less  in  proportion  compared  to  ACCg  (Fig.  5d,  Supplementary  Table  1,  Supplementary 
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1  Figure  5),  were  classified  as  either  other-  or  both-referenced.  This  observation  supports  the  idea 

2  that  a  small  number  of  ACCs  and  OFC  neurons  do  carry  information  about  rewards  allocated  to 

3  another  individual.  What  is  striking  here  is  that  the  majority  of  OFC  and  ACCs  neurons  (80% 

4  and  72%,  respectively)  do  not  carry  such  other-regarding  information  (other-  or  both- 

5  referenced),  whereas  the  majority  of  ACCg  neurons  do  so  (62%).  This  endorses  a  fundamentally 

6  social  role  for  neurons  in  ACCg. 

7  A  prior  study  showed  that  OFC  neurons  modulate  their  activity  when  a  monkey  receives 

8  juice  reward  together  with  another  individual39,  suggesting  that  value  signals  in  OFC  are 

9  sensitive  to  social  context.  In  that  study,  OFC  neurons  responded  differentially  as  a  function  of 

10  whether  the  subject  monkey  received  juice  rewards  alone  or  together  with  another  monkey39.  Our 

1 1  current  study  builds  upon  and  extends  those  findings  in  three  important  ways.  First,  we  used  a 

12  free-choice  task  that  allowed  us  to  infer  the  subjective  value  of  rewards  delivered  to  self,  other, 

13  and  no  one.  Remarkably,  even  in  a  social  context  OFC  neurons  were  selective  for  self  reward, 

14  the  most  preferred  outcome.  Second,  we  compared  the  responses  of  OFC  neurons  to  responses  of 

15  neurons  in  ACCg  and  ACCs  recorded  in  identical  task  conditions,  allowing  us  to  demonstrate 

16  regional  differences  in  the  encoding  of  social  reward  infonnation  in  primate  frontal  cortex. 

17  Third,  when  we  compared  responses  of  ACCg  neurons  on  free-choice  and  cued  trials  we  found 

18  that  responses  to  rewards  delivered  to  the  recipient  monkey  were  largely  absent  when  actors 

19  passively  observed  the  event  rather  than  actively  choosing  it.  Taken  together,  these  extensions 

20  demonstrate  that  social  context  can  impact  the  encoding  of  reward  infonnation  in  all  three  areas: 

21  OFC  appears  to  be  implicated  with  the  evaluation  of  personally  experienced  rewards,  ACCs 

22  evaluates  reward  information  that  is  not  directly  experienced,  and  ACCg  multiplexes  information 
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1  about  the  direct  experience  of  reward  and  vicarious  reinforcement  experienced  by  allocating 

2  reward  to  another  individual. 

3  It  is  noteworthy  that  ACCs  neurons  showed  much  less  modulation  by  actors’  received 

4  reward  outcomes  compared  to  OFC  neurons.  This  is  striking  since  ACCs  neurons  often  show 

5  substantial  modulation  to  received  reward  in  nonsocial  settings11.  ACCg,  on  the  other  hand, 

6  contains  neurons  that  compute  reward  signals  in  both  other  and  self  frames  of  reference. 

7  Together,  our  findings  suggest  that,  as  in  sensory  and  motor  systems40,  identifying  the  frames  of 

8  reference  in  which  reward  outcomes  are  encoded  may  be  important  for  understanding  the  neural 

9  mechanisms  underlying  social  decision-making8. 

10  Accumulating  evidence  endorses  a  special  role  for  the  medial-frontal  cortex  in 

1 1  representing  infonnation  about  another  individual8,41  44.  For  instance,  perceived  similarity  while 

12  observing  others  is  correlated  with  hemodynamic  response  in  the  subgenual  ACC44.  Further,  a 

13  group  of  neurons  in  the  primate  medial-frontal  cortex  selectively  responds  to  observing  actions 

14  performed  by  other  individuals41.  Such  other-referenced  signals,  however,  are  not  limited  to  the 

15  medial  wall  of  the  frontal  cortex.  Neurons  in  the  dorsolateral  prefrontal  cortex  (DLPFC)  track  the 

16  behavior  of  a  computer  opponent  in  an  interactive  game45,  and  BOLD  responses  in  DLPFC  and 

17  ventromedial  prefrontal  cortex  (vmPFC)  during  observational  learning  track  observed  action  and 

18  observed  reward  prediction  errors,  respectively46.  Furthermore,  BOLD  activity  in  anterior  frontal 

19  areas  tracks  preferences  to  donate  to  charity24.  Brain  networks  involved  in  mentalizing47, 

20  vicarious  pain  perception48  and  empathy49  thus  seem  to  be  critical  for  mediating  social 

21  interactions,  suggesting  that  other-regarding  cognition  is  orchestrated  by  a  distributed  network  of 

22  frontal  cortical  areas. 
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1  Social  and  emotional  behaviors  are  highly  idiosyncratic  among  individuals. 

2  Understanding  the  neural  mechanisms  that  drive  such  individual  differences  remains  one  of  the 

3  most  pressing  issues  in  neuroscience.  We  hypothesize  that  the  differential  activation  of  neurons 

4  in  ACCg,  ACCs,  and  OFC  contribute  to  individual  and,  perhaps  species,  differences  in  social 

5  function. 
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1  Figure  Legends 

2 

3  Figure  1  Reward-allocation  task,  (a)  Experimental  setup  for  an  actor  and  a  recipient  monkey,  (b) 

4  Stimulus-reward  outcome  mappings  for  reward  delivered  to  actor  (self),  recipient  (other),  or  no 

5  one  (neither),  shown  separately  for  each  actor,  (c)  Magnitude  cue  used  to  indicate  juice  amount 

6  at  stake  for  each  trial  (see  d).  Position  of  the  horizontal  bisecting  line  specified  the  percentage  of 

7  maximum  reward  possible,  (d)  Task  structure  (see  Online  Methods).  Top  fork,  cued  trials; 

8  bottom  fork,  choice  trials.  Dashed  gray  lines  show  the  angle  of  the  actor’s  gaze,  converging  on 

9  the  fixation  point.  Eye  cartoons  indicate  times  when  the  actor  could  look  around.  RT,  reaction 

10  time.  MT,  movement  time.  ITI,  inter-trial  interval. 

11 

12  Figure  2  Behavior  in  the  reward-allocation  task,  (a)  Proportions  of  incomplete  trials  (mean  ± 

13  s.e.m.)  (see  Online  Methods)  during  the  reward-allocation  task,  (b)  Choice  reaction  times  (ms) 

14  from  trials  in  which  rewards  were  chosen  for  self,  other,  or  neither  (mean  of  session  medians  ± 

15  s.e.m).  (c)  Choice  preferences  (preference  index,  mean  ±  s.e.m.)  as  a  function  of  reward  outcome 

16  contrasts.  Data  points  next  to  each  bar  show  means  for  individual  sessions.  The  degree  of 

17  preference  axis  on  the  right  shows  the  range  of  preference  indices  in  ratio  tenns.  (d)  Choice 

18  preferences  (mean  ±  s.e.m.)  as  a  function  of  reward  magnitude  on  219  single-unit  sessions 

19  collected  with  the  magnitude  cue. 

20 

21  Figure  3  Single  neurons  and  population  responses  from  ACCg.  (a)  Structural  magnetic 

22  resonance  image  from  actor  MO,  with  example  electrode  paths  for  ACCg,  ACCs  and  OFC.  (Also 

23  see  Fig.  6.)  (b)  Mean  responses  (peri-stimulus  time  histograms  [PSTHs])  and  spike  rasters  for  an 


Chang,  Gariepy  &  Platt  20 


Neuronal  reference  frames  for  social  decisions 


1  other- reward  preferring  ACCg  neuron,  on  choice  trials  (upper,  solid  traces)  and  cued  trials 

2  (lower,  dashed  traces).  Data  are  aligned  to  choice/cue  offset  (left)  and  reward  onset  (right)  for 

3  each  reward  outcome.  Bar  histograms  on  right  show  mean  ±  s.e.m.  activity  from  the  two  epochs 

4  (grey  regions).  Color  codes  for  PSTH  traces  and  histograms  shown  below,  (c)  PSTHs  and  spike 

5  rasters  for  a  self- reward  preferring  ACCg  neuron,  (d)  PSTHs  and  spike  rasters  for  a  shared  self 

6  and  other  reward  preferring  ACCg  neuron,  (e)  Normalized  choice/cue  epoch  and  reward  epoch 

7  responses  for  81  ACCg  neurons,  c-e,  same  fonnat  as  in  b.  In  all  bar  histogram  insets,  the 

8  horizontal  lines  above  different  conditions  indicate  significance  differences  (solid,  P  <  0.05  by 

9  paired  /-test;  dashed,  P  <  0.05  by  bootstrap  test). 

10 

1 1  Figure  4  Single  neurons  and  population  responses  from  ACCs  and  OFC.  (a)  PSTHs  and  spike 

12  rasters  for  a  single  ACCs  neuron  preferring  forgone  rewards.  Data  are  aligned  to  choice/cue 

13  offset  (left)  and  reward  onset  (right)  for  each  reward  outcome.  Bar  histograms  on  right  show 

14  mean  ±  s.e.m.  activity  from  the  two  epochs  (grey  regions),  (b)  PSTHs  and  spike  rasters  for  a 

15  single  OFC  neuron  preferring  self  reward,  (c)  Normalized  reward  epoch  responses  of  101  ACCs 

16  neurons,  (d)  Normalized  choice/cue  epoch  and  reward  epoch  responses  of  85  OFC  neurons.  All 

17  panels,  same  format  as  in  Fig.  3. 

18 

19  Figure  5  Population  biases  for  self  other,  and  neither  rewards.  Scatter  plots  show  mean 

20  nonnalized  reward  epoch  responses  (proportion  of  modulation  relative  to  baseline)  of  individual 

21  neurons  (from  left  to  right)  between  self  {Self: Other)  and  other  rewards,  between  other  and 

22  neither  rewards,  between  self  rewards  from  SelfNeither  and  Self:  Other  contexts,  and  between 

23  self  ( SelfNeither )  and  neither  rewards,  for  ACCg  (a),  ACCs  (b),  and  OFC  (c)  populations. 
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1  Regression  lines  (type  II)  are  shown  in  red  (the  circled  data  points  are  excluded  from  the 

2  regression).  Unity  lines  are  shown  in  black.  The  example  neurons  from  Fig.  3,4  are  indicated  on 

3  the  scatter  plots,  (d)  Proportion  of  neurons  (out  of  significantly  classified  neurons)  from  OFC, 

4  ACCs,  and  ACCg  using  self-referenced,  other-referenced,  and  both-referenced  frames  to 

5  represent  reward  outcomes.  Inset  shows  color  codes  used  in  the  bar  graph.  Bars  indicate 

6  significant  differences  in  proportions  (P  <  0.05,  /test). 

7 

8  Figure  6  Anatomical  projections  of  recorded  locations  of  all  ACCg,  ACCs,  and  OFC  cells. 

9  Recording  sites  were  transformed  from  chamber  coordinates  into  interaural  coordinates.  The 

10  interaural  coordinates  of  individual  cells  from  both  monkeys  were  then  projected  onto  standard 

1 1  stereotaxic  maps  of  rhesus  monkeys50,  with  a  2  mm  interaural  spacing  in  the  anterior-posterior 

12  dimension.  Cells  are  shown  on  coronal  slices  and  color-coded  for  the  types  of  frames  of 

13  reference  used,  as  specified  in  Supplementary  Table  1  (see  box).  The  lateral  view  of  the  brain 

14  (inset)  shows  the  locations  of  the  coronal  sections,  cgs,  cingulate  sulcus;  ps,  principal  sulcus; 

15  morb,  medial  orbito frontal  sulcus;  lorb,  lateral  orbito frontal  sulcus;  ros,  rostral  sulcus;  Cd, 

16  caudate. 

17 

18  Figure  7  Prosocial  behavior  and  the  fidelity  of  neuronal  responses  on  Other :Neither  trials,  (a) 

19  ACCg;  (b)  ACCs;  (c)  OFC.  Coefficients  of  variation  in  firing  rate  (CV;  Online  Methods)  during 

20  the  reward  epoch  on  other  reward  trials  are  plotted  as  a  function  of  whether  actors  were  more  or 

21  less  prosocial  on  Other :Neither  trials  based  on  median  split  (higher:  preference  index  greater 

22  than  median;  lower:  preference  index  less  than  median).  Asterisks  indicate  P  <  0.05  by  bootstrap 

23  test. 
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Online  Methods 

1  General  and  behavioral  procedures 

2  All  procedures  were  approved  by  the  Duke  University  Institutional  Animal  Care  and  Use 

3  Committee,  and  were  conducted  in  compliance  with  the  Public  Health  Service’s  Guide  for  the 

4  Care  and  Use  of  Laboratory  Animals. 

5  Two  actor  (MY  and  MO)  and  five  recipient  monkeys  ( Macaca  mulatto)  participated.  For 

6  all  monkeys,  a  sterile  surgery  was  performed  to  implant  a  head-restraint  prosthesis  (Crist 

7  Instruments)  using  standard  techniques11.  Six  weeks  after  surgery,  monkeys  were  trained  on  a 

8  standard,  center-out,  oculomotor  task  for  liquid  rewards.  Actor  monkeys  were  then  trained  on  the 

9  reward-allocation  task  (Fig.  la-d)  in  the  presence  of  a  recipient.  Subsequently,  a  second  surgery 

10  was  performed  on  actors  to  implant  a  recording  chamber  (Crist)  providing  access  to  both  the 

1 1  sulcal  and  gyral  regions  of  the  anterior  cingulate  cortex  (ACCs  and  ACCg,  respectively)  and  the 

12  orbitofrontal  cortex  (OFC).  All  surgeries  were  performed  under  isoflourane  anesthesia  (1-3%), 

13  and  the  recording  chambers  were  regularly  cleaned,  treated  with  antibiotics  and  sealed  with 

14  sterile  caps. 

15  Horizontal  and  vertical  eye  positions  were  sampled  at  1000Hz  using  an  infrared  eye 

16  monitor  camera  system  (SR  Research  Eyelink).  Stimuli  were  controlled  by  PsychToolBox  and 

17  Matlab  (Mathworks).  Actors  and  recipients  sat  in  primate  chairs  (Crist),  100cm  from  one  another 

18  at  a  45-degree  angle  (Fig.  la).  Actors  (both  males)  and  recipients  (four  males,  one  female)  were 

19  unrelated  and  not  cagemates.  Different  pairs  were  selected  depending  on  the  availability  of 

20  recipient  monkeys.  Actors  were  housed  in  a  colony  with  12  other  male  rhesus  macaques,  some  of 

21  which  were  pair-housed.  All  the  male  monkeys  resided  in  this  colony  room,  and  the  one  female 

22  monkey  resided  in  the  adjacent  colony  room  with  other  females.  Out  of  the  total  seven  actor- 
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1  recipient  pairs  tested  in  the  current  study,  the  actor  monkey  was  dominant  over  the  recipient  in 

2  six  cases.  Furthermore,  three  pairs  could  be  classified  as  “more  familiar”  with  one  another 

3  because  their  cages  faced  each  other,  as  defined  previously”.  Based  on  these  relationships,  we 

4  would  expect  a  mixture  of  prosocial  and  competitive  preferences  based  on  our  prior  results 

5  showing  dominant  actors  are  slightly  less  competitive  than  subordinates,  but  also  showing  that 

6  pairs  in  which  the  actor  is  less  familiar  with  the  recipient  are  slightly  less  prosocial  than  when 

7  they  are  more  familiar. 

8  In  the  experimental  setup,  each  monkey  had  his  own  monitor  which  displayed  identical 

9  visual  stimuli.  Both  the  actor  and  recipient  monkeys  had  their  own  tube  from  which  juice  drops 

10  were  delivered.  In  order  to  prevent  monkeys  from  forming  secondary  associations  of  solenoid 

1 1  valve  clicks  or  the  sound  of  the  recipient  drinking  the  juice  reward  with  respect  to  different 

12  reward  types,  the  solenoid  valves  that  delivered  the  juice  rewards  were  placed  in  another  room 

13  and  white  noise  was  also  played  in  the  background.  Experimenters  were  unable  to  hear  solenoids 

14  anywhere  inside  the  recording  room.  Our  control  of  the  acoustic  environment  explicitly  rules  out 

15  a  simple  explanation  that  both-referenced  reward  encoding  found  in  ACCg  is  a  product  of  such 

16  secondary  sensory  associations.  Critically,  a  separate  solenoid  (also  placed  in  another  room)  was 

17  designated  for  neither  rewards;  it  only  produced  clicks  but  delivered  no  fluid. 

18  The  face  region  of  the  recipient,  with  respect  to  the  gaze  angle  of  the  actor  (horizontal 

19  and  vertical  eye  positions),  was  detennined  empirically  prior  to  the  experiments.  The  frequency 

20  with  which  actors  looked  at  recipients  was  computed  from  number  of  gaze  shifts  to  the 

21  recipient’s  face  (±8.5°  from  the  center  of  the  face)33'34.  We  used  a  large  window  to  capture  gaze 

22  shifts  that  were  brief  in  duration  and  large  in  magnitude  and  often  directed  at  varying  depths 

23  (e.g.,  eyes,  mouth)  (Fig.  la). 
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1  Monkeys  performed  the  task  to  obtain  drops  of  cherry-  or  orange-flavored  juice.  Actors 

2  began  a  trial  by  shifting  gaze  (±2.5°)  to  a  central  stimulus  (0.5°  x  0.5°),  and  maintained  fixation 

3  (200ms).  For  219  single-unit  sessions,  the  reward  magnitude  at  stake  (0.1  -  2.4ml)  on  each  trial 

4  was  cued  by  the  position  of  a  horizontal  bisecting  line  (200ms),  indicating  the  percentage  of  the 

5  maximum  possible  volume.  There  were  two  kinds  of  trials,  tenned  choice  trials  and  cued  trials. 

6  Following  a  variable  delay  (300,  500,  700ms),  choice  and  cued  trials  were  presented  at  equal 

7  probabilities,  randomly  interleaved.  On  choice  trials,  two  visual  targets  (4°  x  4°)  appeared  at  two 

8  random  locations  7°  eccentric  in  the  opposite  hemifield.  Actors  shifted  gaze  to  one  target  (±2.5°) 

9  to  indicate  a  choice  within  the  maximum  allowed  time  of  1.5s  (from  stimulus  onset).  The  pair  of 

10  stimuli  appearing  on  a  given  trial  was  drawn  from  the  set  of  three  stimuli  (Fig.  lb), 

1 1  pseudorandomly  selected.  On  cued  trials,  actors  maintained  fixation  (±2.5°)  while  a  cue  (4°  x  4°) 

12  appeared  centrally  (500ms).  Cues  indicating  rewards  for  the  actor,  recipient  or  neither  monkey 

13  occurred  with  equal  frequency,  pseudorandomly  determined  (Fig.  lb).  Reward  onset  was 

14  followed  by  a  0-900ms  delay,  from  the  time  of  either  making  a  choice  or  cue  offset.  Actors  were 

15  free  to  look  around  during  this  delay  and  for  one  second  after  reward  delivery.  Reward  delivery 

16  was  followed  by  an  intertrial  interval  of  700,  1,000,  or  1,300ms.  Upon  making  an  error  (see 

17  below),  both  monkeys  received  visual  feedback  (a  white  rectangle,  10°  x  10°)  followed  by  a  5s 

18  time  out  before  the  next  trial. 

19 

20  Recording  procedures 

21  All  recordings  were  made  using  tungsten  electrodes  (FHC).  Single  electrodes  were 

22  lowered  using  a  hydraulic  microdrive  system  (Kopf  Instruments,  or  FHC).  Single-unit 
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1  waveforms  were  isolated,  and  action  potentials  collected,  using  a  16-channel  recording  system 

2  (Plexon). 

3  In  order  to  guide  the  placement  of  recording  tracks  and  localize  recording  sites,  we 

4  acquired  structural  magnetic  resonance  images  (MRI)  (3T,  1  mm  slices)  of  each  actor’s  brain. 

5  Detailed  localizations  were  made  using  Osirix-viewer.  In  addition  to  MRI-guidance,  we 

6  confirmed  that  electrodes  were  in  ACCg,  ACCs,  or  OFC  by  listening  to  grey-matter  and  white- 

7  matter  associated  sounds  while  lowering  the  electrodes.  ACCg  neurons  were  recorded  from 

8  Brodmann  areas  24a,  24b  and  32;  ACCs  neurons  (dorsal  and  ventral  banks)  were  recorded  from 

9  24c  and  24c’;  OFC  neurons  were  recorded  from  13m  and  11  (based  on  standard  anatomical 

10  references51,52)  (Fig.  3a  and  Fig.  6). 

1 1  Single-unit  recordings  were  made  from  two  actor  monkeys  while  each  was  engaged  in  a 

12  reward-allocation  task  with  a  recipient  monkey  in  267  sessions.  A  total  of  81  ACCg  neurons 

13  (MY:  45,  MO:  36),  101  ACCs  neurons  (MY:  39,  MO:  62),  and  85  OFC  neurons  (MY:  46,  MO: 

14  39)  were  included  in  the  study.  Neurons  were  selected  for  recording  based  solely  on  the  quality 

15  of  isolation.  For  a  small  subset  of  the  data  (18%  of  the  total)  (ACCg:  0%;  ACCs:  25%;  OFC: 

16  27%),  data  were  collected  in  a  task  with  a  fixed  reward  size  (typically  1.0ml  per  successful  trial) 

17  (identical  to  Fig.  Id  except  without  the  magnitude  cue).  For  the  majority  of  the  cells  (82%  [n  = 

18  219]  of  the  total),  data  were  either  collected  in  a  task  with  the  magnitude  cue  (Fig.  Id)  (ACCg: 

19  100%  [n  =  81];  ACCs:  60%  [/;  =  61];  OFC:  42%  [n  =  36]),  or  both  with  and  without  the 

20  magnitude  cue  (i.e.,  two  or  more  consecutive  blocks  per  cell)  (ACCg:  0%;  ACCs:  15%  [n  =  15]; 

21  OFC:  31%  [//  =  26]).  We  combined  the  two  types  of  data  in  our  analyses  unless  otherwise 

22  specified. 
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Data  from  each  cell  consisted  of  firing  rates  during  440  ±  13  (±217)  (median  ±  s.e.m. 
(±s.d.))  trials.  A  trial  was  considered  “incomplete”  if  the  monkey  failed  to  choose  a  target  on 
choice  trials  (choice-avoidance  error)  or  to  maintain  fixation  after  cue  onset  on  cued  trials 
(forced-choice  avoidance  error).  Such  trials  were  not  included  in  the  neural  analysis.  The 
monkeys  perfonned  the  task  well,  as  evidenced  by  a  high  percentage  of  correct  trials  even  on 
trials  in  which  they  did  not  receive  juice  reinforcement  (Fig.  2a). 


Data  analysis 


Choice  preference  indices  were  constructed  as  contrast  ratios  (Eq.  I)33’34 


Preference  Index  = 


Ra  ~  Rb 
Ra  +  Rb 


(1) 


Ra  and  Rb  were  the  frequency  of  making  particular  choices.  For  Self:Other  trials,  Ra  and  Rb  were 
number  of  choices  to  reward  other  and  self,  respectively.  For  Other.Neither  trials,  Ra  and  Rb 
were  number  of  choices  to  reward  other  and  neither,  respectively.  Finally,  for  SelfiNeither  trials, 
Ra  and  Rb  were  number  of  choices  to  reward  neither  and  self,  respectively.  Indices  therefore 
ranged  from  -1  to  1,  with  1  corresponding  to  always  choosing  to  allocate  reward  to  other  on 
Other.Neither  trials  and  SelfOther  trials,  and  always  choosing  not  to  reward  self  on  SelfNeither 
trials.  An  index  of-1  corresponds  to  the  opposite,  generally  stated  as  choosing  not  to  allocate 
reward  to  the  other  monkey  or  choosing  to  reward  oneself.  Values  of  0  indicated  indifference. 
For  constructing  neuronal  preferences,  we  simply  substituted  the  choice  frequency  with  neuronal 
firing  rates  associated  with  making  specific  decisions.  Response  times,  the  time  from  the  onset  of 
choices  to  movement  onset,  were  computed  using  a  20°/sec  velocity  threshold  criterion33'34. 

Spike  rates  were  computed  during  the  reward  epoch  (from  50  to  600ms  from  reward 
onset)  as  well  as  the  choice/cue  epoch  (from  -100  to  300ms  from  making  a  choice  or  cue  offset). 
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1  For  the  population  analyses,  we  normalized  reward  firing  rates  to  the  average  baseline  rates  for 

2  each  reward  outcome  (300ms  interval  prior  to  fixation  onset).  Using  marginally  different  time 

3  windows  and  different  normalization  methods  all  resulted  in  similar  conclusions.  Coefficients  of 

4  variation  (CV)  were  calculated  for  each  neuron  based  on  the  standard  deviation  (a)  and  mean  (p) 

5  using  the  spike  rates  (sp/s)  from  the  reward  epoch  (Eq.  2): 

CV=-  (2) 

6  A4 

7  In  OFC  and  ACCs  populations,  the  two  self  rewards  (i.e.,  self  rewards  chosen  from  SelfNeither 

8  and  Self:  Other  trials)  were  largely  indifferent  (see  Fig.  4,  5b,  5c  and  Results)  and  thus  we 

9  combined  them  by  taking  means  for  the  CV  analysis.  In  contrast,  the  population  of  ACCg 

10  neurons  responded  more  strongly  to  self  rewards  obtained  from  a  social  context  {Self: Other) 

1 1  compared  to  when  there  was  no  reward  stake  for  the  other  monkey  {SelfNeither)  and  thus  we 

12  consider  the  two  self  rewards  separately  in  ACCg  (see  Fig.  3,  5a  and  Results). 

13  Analysis  of  variance  (ANOVA)  was  used  to  classify  the  reward  response  selectivity  of 

14  individual  neurons  from  each  area  and  perfonned  per  individual  cells.  Two-factor  ANOVA  was 

15  used  to  classify  the  selectivity  of  reward  outcome  {self  other,  or  neither)  and  trial  type  (choice  or 

16  cued)  for  all  neurons.  Three-factor  ANOVA  was  used  to  classify  the  selectivity  of  reward 

17  volume  (binned  into  small,  medium,  large)  for  the  82%  of  cells  from  all  areas  that  were  collected 

18  in  the  task  with  a  magnitude  cue.  Statistical  significance  for  each  reward  type  was  computed  by 

19  Tukey  HSD  test.  Finally,  we  excluded  three  OFC  cells  when  our  analyses  involved  using  the 

20  data  from  neither  rewards  because  these  cells  were  recorded  on  very  rare  sessions  in  which  the 

21  monkeys  either  never  chose  the  neither  reward  option  or  did  so  fewer  than  four  times.  Across  all 

22  analyses,  using  slightly  different  epoch  durations  for  neuronal  data  analyses  led  to  similar  results. 

23 
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1  Classification  of  cell  types  by  significant  reward  specificity 

2  Based  on  Tukey  HSD  tests  from  the  one-way  ANOVA  on  reward  outcome  ( self  other,  or 

3  neither )  for  both  the  choice/cue  epoch  and  reward  epoch  responses,  we  classified  cells  into  the 

4  following  categories:  self-referenced,  other-referenced,  both-referenced,  and  unclassified.  These 

5  categories  do  not  imply  functional  roles  but  indicate  that  firing  rates  were  significantly  different 

6  based  on  reward  outcomes.  We  refer  to  a  neuron  as  self-referenced  if  the  responses  of  the  neuron 

7  were  significantly  different  ( P  <  0.05)  between  self  and  other  rewards  as  well  as  between  self  and 

8  neither  rewards,  but  not  different  between  other  and  neither  rewards.  We  refer  to  a  neuron  as 

9  other-referenced  if  the  responses  of  the  neuron  showed  significant  differences  in  firing  rates 

10  between  self  and  other  rewards  as  well  as  between  other  and  neither  rewards,  but  not  different 

11  between  self  and  neither  rewards.  Finally,  we  refer  to  a  neuron  as  both-referenced  if  the 

12  responses  of  the  neuron  showed  significant  differences  in  responses  between  self  and  neither 

13  rewards  as  well  as  other  and  neither  rewards,  but  not  different  between  self  and  other  rewards. 

14  Neurons  that  did  not  fall  into  one  of  these  categories  were  considered  as  unclassified.  Applying 

15  slightly  different  criteria  or  differently  configured  ANOVA  did  not  change  the  overall 

16  proportional  trends  of  these  classes. 

17 

1 8  Reward  magnitude  analysis 

19  We  examined  reward  magnitude  modulation  in  219  neurons  (i.e.,  82%  of  all  neurons 

20  collected  with  the  magnitude  cue;  81  ACCg,  76  ACCs,  and  62  OFC  neurons).  We  performed  a 

21  linear  regression  on  the  activity  (sp/s)  of  individual  neurons  across  unbinned  reward  sizes.  We  fit 

22  the  data  using  the  reward  epoch  activity  separately  for  self  other,  and  neither  reward  outcomes 

23  and  obtained  fitted  slopes  (i.e.,  reward  magnitude  sensitivity  in  sp/s/ml)  for  each  reward 
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1  outcome.  For  examining  the  relationship  between  the  reward  magnitude  sensitivity  across  actors’ 

2  received  and  foregone  reward  outcomes,  we  compared  the  average  signed  slopes  from  all 

3  received  rewards  (s elf  rewards  on  choice  and  cued  trials)  and  all  foregone  rewards  ( other  and 

4  neither  reward  on  choice  and  cued  trials)  in  individual  neurons. 
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Supplementary  Table  1 


Supplementary  Table  1 .  Classification  of  the  reward  type,  trial  type,  and  reward  size  selectivities 
at  the  level  of  individual  neurons  from  OFC,  ACCs,  and  ACCg,  based  on  analysis  of  variance. 


Area 

Proportion  of 

Proportion  of 

Proportion  of 

significant  neurons 

significant  neurons 

reference  frame  types 

by  factors 

between  different  rewards 

(reward  or  choice/cue  epoch) 

(reward  epoch) 

(reward  epoch) 

Reward  Outcome 

57%  (n=81) 

Self  vs.  Neither 

31%  (n=81) 

SELF  frame 

Trial  Type 

36%  (n=81) 

of  reference 

Reward  Volume 

12%  (n=81) 

Self  vs.  Other 

36%(n=81) 

Reward  Outcome  x 

30%  (n=81) 

Trial  Type 

Other  vs.  Neither 

25%  (n=81 ) 

OTHER  frame 

Reward  Volume  x 

7%  (n=81) 

of  refernece 

Reward  Outcome 

Self  (Self -.Other) 

12%  (n=81) 

Reward  Volume  x 

4%  (n=81) 

vs.  Self  (Self:Neither) 

Trial  Type 

BOTH  frame 

Reward  Outcome  x 

7%  (n=81) 

of  reference 

Trial  Type  x 

Reward  Volume 

Reward  Outcome 

72%  (n=101) 

Self  vs.  Neither 

57%(n=101) 

SELF  frame 

Trial  Type 

52%  (n=101) 

of  reference 

Reward  Volume 

25%  (n=  76) 

Self  vs.  Other 

53%  (n=101) 

Reward  Outcome  x 

36%  (n=101) 

Trial  Type 

Other  vs.  Neither 

O 

■Vp 

O' 

O 

CM 

OTHER  frame 

Reward  Volume  x 

16%  (n=76) 

of  refernece 

Reward  Outcome 

Self  (Self -.Other) 

5%  (n=101) 

Reward  Volume  x 

4%  (n=  76) 

vs.  Self  (Self:Neither) 

Trial  Type 

BOTH  frame 

Reward  Outcome  x 

8%  (n= 76) 

of  reference 

Trial  Type  x 

Reward  Volume 

Reward  Outcome 

57%  (n=85) 

Self  vs.  Neither 

37%  (n= 85) 

SELF  frame 

Trial  Type 

45%  (n=85) 

of  reference 

Reward  Volume 

24%  (n=62) 

Self  vs.  Other 

42%  (n= 85) 

Reward  Outcome  x 

37%  (n=85) 

Trial  Type 

Other  vs.  Neither 

14%  (n=85) 

OTHER  frame 

Reward  Volume  x 

10%  (n=62) 

of  refernece 

Reward  Outcome 

Self  (Self -.Other) 

13%  (n=85) 

Reward  Volume  x 

10%  (n=62) 

vs.  Self  (Self:Neither) 

Trial  Type 

BOTH  frame 

Reward  Outcome  x 

11%  (n=62) 

of  reference 

15%  (n=12/81) 

[38%]  (n=12/32) 

(MY:  48%;  MO:  18%) 

12%  (n=10/81) 

[31  %]  (n=10/32) 

(MY:  19%;  MO:  55%) 

12%  (n=1 0/81 ) 

[31  %]  (n=10/32) 

(MY:  33%;  MO:  27%) 


51%  (n=51/101) 

[72%]  (n=51/71) 

(MY:  82%;  MO:  61%) 

10%  (n=1 0/1 01 ) 

[1 4%]  (n=10/71) 

(MY:  9%;  MO:  20%) 

10%  (n=1 0/1 01 ) 

[1 4%]  (n=10/71) 

(MY:  9%;  MO:  19%) 


OFC 


Trial  Type  x 
Reward  Volume 


42%  (n=36/85) 

[80%]  (n= 36/45) 

(MY:  86%;  MO:  70%) 

5%  (n=4/85) 

[9%]  (n=4/45) 

(MY:  7%;  MO:  12%) 

6%  (n=5/85) 

[11  %]  (n=5/45) 

(MY:  7%;  MO:  18%) 


The  bold  percentages  shown  inside  the  brackets  on  the  4th  column  show  the  proportions  out  of  classfied  neurons.  Shown  below  in 
parentheses  are  these  proportions  for  each  monkey.  Significance  in  all  panels  was  based  on  P<  0.05  (analysis  of  variance  and 
tukey  HSD  tests). 
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Supplementary  Figure  &  Legends 
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Supplementary  Figure  1 


Supplementary  Figure  1  Percentage  of  gaze  shifts  (mean  ±  s.e.m.)  directed  toward  the 
recipient  prior  to  reward  delivery  (pre-reward  epoch;  grey)  and  following  the  onset  of  reward 
delivery  (post-reward  epoch;  black)  on  choice  trials  (a)  and  cued  trials  (b).  Asterisks  indicate 
significant  differences  ( P  <  0.05,  Welch  two  sample  /-test).  Choice  trials.  During  the  delay 
period  between  choice  and  reward  delivery  (pre -reward  epoch,  0-0.9  sec;  see  Fig.  Id),  gaze 
frequencies  were  comparable  across  trials  involving  actors’  received  and  allocated  rewards  to 
other  (44.75  ±  1.78%  [mean  ±  s.e.m.],  44.57  ±  1.78%,  42.74  ±  0.83%  on  Self:Neither, 
Self: Other,  Other:Neither  trials,  respectively;  all  comparisons  P  >  0. 1 7,  paired  /-test).  Following 
the  onset  of  reward  delivery  (post-reward  epoch,  1  sec),  however,  these  frequencies  were 
significantly  higher  on  Other:Neither  trials  (67.01  ±  1.01%)  compared  to  SelfOther  (53.78  ± 
1.88%)  and  SelfNeither  trials  (53.95  ±  1.88%)  (both,  P  <0.0001,  paired  t- test).  Cued  trials. 
During  the  delay  period  between  cue  offset  and  reward  delivery  (pre-reward  epoch),  gaze 
frequencies  were  comparable  across  cued  self,  cued  other  and  cued  neither  trials  (42.19  ±  1.90, 
41.72  ±  0.92,  and  40.03  ±  0.92,  respectively;  all  comparisons  P  >  0.20).  Following  the  onset  of 
reward  delivery  (post-reward  epoch),  however,  gaze  frequencies  were  the  highest  for  cued  other 
trials  (66.70  ±  1.15),  compared  to  cued  self  { 53.19  ±  1.93)  or  cued  neither  trials  (63.26  ±  1.05) 
(both  P  <  0.05).  Therefore,  actors  looked  at  the  recipient  at  different  rates  depending  on  reward 
outcomes,  as  reported  previously33,34. 
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Supplementary  Figure  2 


Supplementary  Figure  2  Choice  preferences  for  each  actor  monkey  (MY  and  MO).  Shown  are 
the  preference  indices  as  a  function  of  reward  outcome  contrasts  (i.e.,  choice  contexts)  for  actor 
MY  (left  panel)  (130  single -unit  sessions)  and  for  actor  MO  (right  panel)  (137  single-unit 
sessions).  Data  points  next  to  each  bar  show  individual  sessions.  The  preference  index  for  actor 
MY  was  -1.00  ±  0.00  (mean  ±  s.e.m.)  for  Self:Neither,  -1.00  ±  0.00  for  Self: Other,  and  0.28  ± 
0.02  for  Other :Neither  trials  (significantly  different  from  zero:  all  P  <  0.0001,  one-sample  t- test). 
For  actor  MO,  the  preference  index  for  SelfNeither  was  -0.98  ±  0.00,  SelfOther  was  -0.97  ± 
0.00,  and  Other :Neither  was  0.07  ±  0.01  (significantly  different  from  zero:  all  P  <  0.0001,  one- 
sample  /-test).  These  choice  behaviors  are  consistent  with  our  previous  studies  using  a  similar 
behavioral  paradigm,  which  also  demonstrated  differential  reward  allocation  preferences 
depending  on  the  familiarity  and  social  status  between  the  two  animals33  and  the  causal  role  of 
neuropeptide  oxytocin  in  modulating  these  preferences34. 
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Supplementary  Figure  3 


Supplementary  Figure  3  Preferential  reward  encoding  biases  in  each  actor  monkey  (MY  and 
MO).  Shown  are  the  normalized  responses  (mean  ±  s.e.m.)  to  different  reward  outcomes  during 
the  reward  epoch  from  ACCg  (a),  ACCs  (b),  and  OFC  (c).  The  inset  shows  the  color  coding 
scheme. 
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Supplementary  Figure  4 


Supplementary  Figure  4  Comparisons  of  neuronal  responses  across  the  choice/cue  epoch  and 
the  reward  epoch  for  (a)  ACCg  ( n  =  81),  (b)  ACCs  ( n  =  101),  and  (c)  OFC  ( n  =  85).  Plotted  are 
the  normalized  responses  from  the  two  epochs  for  the  following  comparisons:  self  choices  versus 
self  rewards,  foregone  choices  versus  foregone  rewards,  and  other  choices  versus  other  rewards. 
Data  points  with  black  outlines  indicate  that  these  values  were  truncated  for  the  purpose  of  these 
displays.  Significance  values  are  shown  at  the  top  of  each  panel  comparing  the  ordinate  and 
abscissa  at  the  population  level  using  a  paired  t-test.  At  the  population  level,  we  observed  a 
remarkable  resemblance  in  neuronal  activity  across  the  choice/cue  epoch  and  reward  epoch  in 
ACCg,  ACCs,  and  OFC  (Fig.  3  &  4).  To  quantify  this  similarity,  we  directly  compared 
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normalized  activity  from  the  two  epochs  corresponding  to  the  following  pairs:  self  choices 
(choices  leading  to  self  rewards)  versus  self  rewards  (delivery  of  ,sx7/'rcwards),  foregone  choices 
(leading  to  other  or  neither  rewards)  versus  foregone  rewards  (delivery  of  other  or  neither 
rewards),  and,  finally,  other  choices  versus  other  rewards.  Although  the  majority  of  comparisons 
resulted  in  similar  responses  across  the  two  epochs,  there  were  some  differences.  The  following 
summarizes  the  population  level  effects  that  we  have  observed  here.  ACCg  as  a  population 
showed  greater  activity  for  .se//  rewards  during  the  time  of  reward  delivery  compared  to  the  time 
around  making  choices  leading  to  self  rewards  (P  =  0.05,  paired  t-test).  On  the  other  hand, 
responses  of  ACCs  neurons  were  similar  in  magnitude  across  all  three  comparisons  (all  P  >  0.15, 
paired  t-test).  In  contrast,  OFC  neurons  showed  a  trend  toward  greater  responses  to  foregone 
choices  compared  to  foregone  rewards  (P  <  0.07,  paired  t-test),  as  well  as  greater  responses  to 
other  choices  compared  to  other  rewards  (P  <  0.08).  At  the  individual  cell  level,  however,  a 
substantial  number  of  neurons  from  each  area  showed  significantly  modulated  activity  across  the 
two  epochs.  In  ACCg,  38%  (n  =  31),  24%  (n  =  19),  and  21%  (n  =  17)  of  neurons  showed 
significantly  modulated  activity  across  the  two  epochs  for  self  foregone,  and  other  choices 
versus  rewards,  respectively.  In  ACCs,  these  proportions  were  63%  (n  =  64),  39%  (n  =  39),  and 
30%  (n  =  30),  whereas  in  OFC,  these  proportions  were  60%  (n  =  5 1),  40%  (n  =  34),  and  28%  (n 
=  24). 
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Supplementary  Figure  5  Proportion  of  neurons  (out  of  significantly  classified  neurons)  after 
correcting  for  eye  movement  choice  reaction  times  (i.e.,  by  using  reaction  times  as  covariates  in 
the  general  linear  model)  from  OFC,  ACCs,  and  ACCg  using  self-referenced,  other-referenced, 
and  both-referenced  frames  for  representing  reward  outcomes.  Inset  shows  color  codes  used  in 
the  bar  graph.  Bars  indicate  significant  differences  in  proportions  ( P  <  0.05,  %  test).  The 
proportions  of  reference  frame  types  across  the  three  areas  remain  similar  even  after  correcting 
for  trial-by-trial  reaction  times  (compare  to  Fig.  5d).  Figure  2b  clearly  indicates  that  different 
choices  are  associated  with  different  choice  reaction  times.  Therefore,  it  is  possible  that 
differential  encoding  schemes  reported  here  might  be  simply  driven  by  the  subjective  value  of 
different  choices  (as  inferred  from  reaction  times).  For  example,  if  neurons  were  merely 
computing  the  subjective  value  associated  with  different  choices,  one  might  expect  choice 
reaction  times  to  explain  a  large  amount  of  variance  in  neuronal  response.  We  directly  tested  this 
hypothesis  by  including  trial-by-trial  reaction  times  as  covariates  in  the  ANOVA  and  re¬ 
calculated  the  proportion  of  neurons  classified  within  different  functional  categories.  This  figure 
shows  the  distribution  of  different  reference  frame  types  across  each  area  after  taking  into 
account  choice  reaction  times.  The  results  are  virtually  identical  to  those  shown  in  Fig.  5d, 
suggesting  that  self-referenced,  other-referenced,  and  both-referenced  neurons  are  not  the 
products  of  encoding  the  subjective  value  (as  revealed  by  reaction  times)  associated  with 
different  choices.  Therefore,  the  neurons  in  the  current  study  appear  to  signal  specific  decision 
outcomes  during  social  decision-making,  rather  than  directly  encoding  their  subjective  value. 
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Supplementary  Figure  6  Effects  of  value  on  self,  other,  and  neither  reward  responses,  (a-c) 
Reward  magnitude  sensitivity  for  ACCg,  ACCs,  and  OFC  (unsigned  slopes,  sp/s/ml,  mean  ± 
s.e.rn.)  computed  from  linear  regression  of  reward  epoch  activity  as  a  function  of  increasing 
value  for  self  (a),  other  (b),  or  neither  (c)  outcomes.  Horizontal  bars  above  the  histograms 
indicate  significance  (solid:  P  <  0.05,  dashed:  P  <  0.10,  Welch  two  sample  f-test).  (d)  Value 
sensitivity  for  actors’  received  and  foregone  rewards  are  positively  correlated  in  ACCs.  Slopes  of 
linear  regressions  of  reward  epoch  activity  as  a  function  of  reward  volume  for  actors’  received 
rewards  {Self: Neither,  Self:  Other,  and  cued  self)  and  foregone  rewards  {other,  neither  rewards, 
other  cued,  and  neither  cued).  Red  points  indicate  significant  effects  of  reward  value  or 
interactions  (ANOVA).  Line  indicates  type  II  regression  for  neurons  with  significant  reward 
value  or  interaction  effects,  r  and  p  reflect  Pearson’s  correlations  for  significant  cells.  Here  we 
examined  response  modulations  by  the  magnitude  of  reward  delivered  to  self,  other,  and  neither. 
Based  on  the  ANOVA  on  reward  epoch  responses,  40%  of  OFC  (out  of  62  cells  collected  in  a 
task  with  a  reward  magnitude  cue),  40%  of  ACCs  (of  76),  and  21%  of  ACCg  (of  81)  showed 
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either  a  significant  effect  of  reward  magnitude  or  a  significant  interaction  involving  reward 
magnitude  (Supplementary  Table  1).  ACCg  contained  a  significantly  smaller  proportion  of 
neurons  modulated  by  reward  magnitude  compared  to  either  OFC  or  ACCs  (both  P  <  0.05, 
test),  whereas  OFC  and  ACCs  did  not  differ  (P  =  1).  Out  of  these  regions,  ACCs  neurons  showed 
the  greatest  sensitivity  to  reward  magnitude  based  on  the  slopes  of  the  regression  line  for  each 
neuron  across  all  outcomes,  consistent  with  a  prominent  role  for  ACCs  in  behavioral 
adjustment3 5-3 7  in  an  environment  with  constantly  changing  reward  types  and  contexts.  We  next 
explored  in  detail  how  the  magnitude  of  foregone  rewards  and  self  rewards  was  encoded  in  each 
area.  We  found  a  significant  positive  relationship  between  actors’  received  and  forgone  rewards 
in  the  sample  of  ACCs  neurons  showing  significant  effects  of  reward  magnitude  (significant 
cells:  r  =  0.50,  P  <  0.005;  all  cells:  r  =  0.33,  P  <  0.005,  Pearson’s  correlation)  (d).  By  contrast, 
we  did  not  observe  this  relationship  in  the  ACCg  or  OFC  neurons  with  significant  reward 
magnitude  effects:  both  regions  \r\  <  0.28,  P  >  0.18;  all  cells:  both  r  <  0.21,  P  >  0.11;  Pearson’s 
correlation)  (see  Supplementary  Figure  7).  Thus,  the  ACCs,  but  not  the  ACCg  or  OFC, 
processes  actors’  direct  and  forgone  rewards  in  a  similar  manner  (i.e.,  scale  in  the  same 
direction),  consistent  with  its  hypothesized  role  in  learning  from  both  experience  and 
observation8'11. 
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Supplementary  Figure  7 


Supplementary  Figure  7  Reward  magnitude  sensitivity  between  actors’  received  and  foregone 
rewards  in  ACCg  (a)  and  OFC  (b).  Plotted  are  slopes  from  a  linear  regression  of  reward  epoch 
activity  as  a  function  of  different  reward  volumes  between  actors’  received  rewards 
(Self: Neither,  Self:  Other,  and  cued  self)  and  foregone  rewards  (other,  neither  rewards,  other 
cued,  and  neither  cued).  Red  data  points  indicate  significant  reward  magnitude  main  or 
interactions  effects  (ANOVA).  Shown  as  texts  in  the  inset  are  r  and  significance  from  Pearson’s 
correlation  for  the  significant  neurons.  Data  points  with  black  outline  show  the  outlier  cells 
excluded  from  the  correlation  analysis. 
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Supplementary  Figure  8 


Supplementary  Figure  8  Differences  in  coefficient  of  variation  (CV)  across  different  reward 
outcomes  reflect  reward  bias  in  ACCg  (a),  ACCs  (b)  and  OFC  (c).  Plotted  are  differences  in  CV 
between  a  pair  of  reward  categories  (as  indicated  on  the  right  of  each  distribution).  We  compared 
individual  neuron  averages  of  all  trials  in  which  the  actors  received  rewards  against  all  trials  in 
which  the  actors  did  not  receive  rewards  ( Received  -  Foregone)  (top  of  each  panel).  We  also 
compared  individual  neuron  averages  of  trials  in  which  the  recipient  received  the  rewards  against 
trials  in  which  no  one  received  rewards  ( Other  -  Neither )  (middle  of  each  panel),  and  between 
trials  in  which  the  actors  received  rewards  in  Self : Neither  against  Self: Other  contexts  (Self 
(SelfNeither)  -  Self  (Self:  Other))  (bottom  of  each  panel).  If  applicable,  the  data  were  collapsed 
across  choice  and  cued  trials  for  this  analysis.  Data  points  are  jittered  in  the  vertical  dimensions 
for  visibility.  Asterisks  above  the  data  points  indicate  significance  (**:P  <  0.05,  *:  P  <  0.10,  one 
sample  /-test)  in  the  distribution.  An  alternative  way  to  examine  neuronal  information  encoding 
is  to  assess  whether  lower  trial-to-trial  variability  is  associated  with  preferred  outcomes.  We 
tested  whether  the  coefficient  of  variation  in  firing  rates  (CV;  Online  Methods  Eq.  2)  was 
systematically  lower  for  preferred  reward  outcomes  (based  on  response  magnitude)  compared  to 
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non-preferred  reward  outcomes.  We  found  this  to  be  the  case.  The  OFC  population  showed  a 
lower  CV  for  self  rewards  vs.  rewards  delivered  to  other  or  neither  {Received  -  Foregone,  -0.12 
±  0.04  [mean  ±  s.e.m.],  P  <  0.01,  one-sample  /-test),  whereas  the  ACCs  population  showed  a 
lower  CV  for  rewards  delivered  to  other  or  neither  {Foregone)  (0.07  ±  0.03,  P  <  0.05,  one 
sample  /-test).  In  ACCg,  where  some  neurons  preferred  self  and  some  preferred  other  rewards, 
we  found  a  lower  CV  only  for  actors’  received  rewards  compared  to  foregone  rewards  (-0.07  ± 
0.04,  P  <  0.09,  one  sample  /-test,  P  <  0.05,  bootstrap  test),  but  no  difference  between  other  and 
neither  rewards,  or  between  the  two  contexts  of  receiving  self  rewards  (all  P  >  0.34).  Thus,  the 
most  robust  responses  of  neurons  in  all  three  areas  were  also  the  most  reliable. 
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Supplementary  Figure  9 


Supplementary  Figure  9  Reward  coding  is  not  driven  by  gaze  shifts  directed  at  the  recipient. 
Shown  are  histograms  of  the  differences  in  normalized  reward  epoch  responses  between  trials 
with  gaze  shifts  and  without  gaze  shifts  (responses  ‘with’  -  responses  ‘without’  gaze  shifts),  for 
trials  in  which  rewards  were  delivered  to  self  (top),  other  (middle),  or  neither  (bottom),  for 
ACCg  (a),  ACCs  (b),  and  OFC  (c)  populations.  Arrows  indicate  distribution  means.  In  the 
reward- allocation  task,  actors  were  allowed  to  look  at  the  recipient  (Fig.  Id  &  Supplementary 
Figure  1).  To  rule  out  the  possibility  that  preferential  reward  responses  of  neurons  in  these  areas 
were  simply  driven  by  where  the  actors  looked  on  a  given  trial,  we  compared  reward  epoch 
responses  between  trials  with  and  without  gaze  shifts  to  the  recipient.  We  found  no  systematic 
differences  in  these  reward  responses  at  the  population  level  (each  areas  and  each  reward 
outcome:  all  P  >  0.20,  Wilcoxon  signed  rank  test).  The  reward-related  responses  in  the  three 
regions  are  thus  neither  simply  driven  by  preparation  to  look  at  the  recipient  nor  elicited  as  a 
consequence  of  inspecting  the  recipient. 
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Abstract 


In  this  issue,  Schilbach  et  al.  defend  a  "second-person  neuroscience"  perspective  that  focuses  on  the 
neural  basis  of  social  cognition  during  live,  on-going  interactions  between  individuals.  We  argue  that  a 
second-person  neuroscience  would  benefit  from  formal  approaches  borrowed  from  economics  and 
behavioural  ecology  and  that  it  should  be  extended  to  social  interactions  in  non-human  animals. 

Main  text 


The  "second-person  neuroscience"  proposed  by  Schilbach  et  al.  in  the  current  issue  of  Behavioral  and 
Brain  Sciences  proffers  the  intriguing  idea  that  social  cognition  during  real-time  interactions  with 
another  individual  may  be  fundamentally  different  from  passive  observations  of  another's  actions. 
Understanding  the  contribution  of  neural  processes  to  on-going  interactions  with  complex  beings  is  a 
fascinating  research  direction,  with  potential  implications  for  the  treatment  of  disorders  attended  by 
social  deficits  as  well  as  for  ethics  and  public  policy. 

Several  decades  of  neuroscientific  research  have  sketched  out  the  neural  circuits  that  may  translate 
perceptual  information  about  other  individuals  into  purposeful  action.  Specifically,  regions  of  the  human 
and  nonhuman  primate  brain  including  the  superior  temporal  sulcus  and  fusiform  face  area  contribute 
to  social  identification  (Tsao  et  al.,  2008).  The  ventromedial  prefrontal  cortex,  orbitofrontal  cortex  and 
striatum  appear  to  play  a  role  in  translating  knowledge  of  others  into  motivational  signals  (Burke  et  al., 
2010;  Cooper  et  al.,  2010;  Azzi  et  al.,  2012).  The  anterior  cingulate  cortex  and  fronto-insular  cortex 
contribute  to  empathy  and  other-regarding  cognition  (Decety,  2010;  Gu  et  al.,  2010).  The  so-called 
mentalizing  and  mirroring  networks  appear  to  participate  in  action  and  intention  understanding 
(Rizzolatti  and  Sinigaglia,  2010;  Becchio  et  al.,  2012).  Circuits  connecting  these  areas  could  translate 


social  perceptual  information  into  appropriate  actions  via  decision-making  mechanisms  (Baumgartner  et 
al.,  2009;  Knoch  et  al.,  2009;  Baumgartner  et  al.,  2011). 

To  better  understand  the  neural  mechanisms  underlying  social  cognition,  we  propose  that  social 
neuroscience  needs  to  ground  its  predictions  and  hypotheses  in  a  formal  framework  such  as  that 
provided  by  behavioural  game  theory  (Platt  and  Glimcher,  1999;  Dorris  and  Glimcher,  2004;  Kosfeld  et 
al.,  2005;  Tomlin  et  al.,  2006;  Lee,  2008;  Gintis,  2009).  Schilbach  et  al.  criticize  game  theoretical 
approaches  for  not  recreating  the  dynamics  of  everyday  real-life  social  encounters,  but  this  common 
opposition  has  been  rebutted  before  (Gintis,  2009).  Game  theoretical  frameworks  are  general  and  open, 
allowing  formal  delineation  of  specific  hypotheses  while  not  imposing  restrictions  on  the  behaviours 
that  are  being  described.  Formal  approaches  borrowed  from  economics,  game  theory  and  behavioural 
ecology  have  been  extremely  useful  in  describing  decisions  in  dynamic  foraging  or  social  environments 
(Sugrue  et  al.,  2004;  Lee,  2008;  Hayden  et  al.,  2011;  Chang  et  al.,  2011). 

These  approaches  can  be  extended  to  describe  the  dynamics  of  interacting  individuals,  with  several 
advantages.  First,  they  allow  us  to  generate  empirically-testable  and  mathematically-formalizable 
predictions  about  the  neural  mechanisms  that  could  underlie  decisions  in  complex  social  environments. 
Second,  they  allow  for  comparative  analyses  of  decision  processes  in  humans  and  other  animals  with 
respect  to  the  demands  placed  on  them  in  specific  physical  and  social  environments  (Kacelnik  and 
Bateson,  1996;  Stephens  et  al.,  2002;  Heilbronner  et  al.,  2008). 

Schilbach  et  al.  also  raise  the  concern  that  classical  game  theory  paradigms  involve  mainly  one-shot 
interactions  or  turn-taking.  Although  this  structure  is  often  used  for  simplicity,  we  contend  that 
continuous  interactions  in  interactive  games  can  also  be  effectively  described  using  a  similar  theoretical 
framework  (Debreu,  1952;  Braun  et  al.,  2009).  Such  mathematical  tools  would  help  translate  some  of 
the  intuitive  aspects  of  Schilbach  et  al.'s  approach  into  concrete  experimental  predictions. 

Second-person  neuroscience  would  also  benefit  from  broadening  its  inquiry  to  the  interactions  of 
nonhuman  animals  (Washburn  et  al.,  1990;  Fujii  et  al.,  2007;  Chang  et  al.,  2011).  Social  complexity 
appears  to  have  favored  the  evolution  of  higher  social  cognition  in  animals  that  have  brains  similar  to 
ours,  like  macaques  (Barsalou,  2005;  Rudebeck  et  al.,  2006;  Tsao  et  al.,  2008;  Azzi  et  al.,  2012;  Chang  et 
al.,  2012)  and  in  animals  that  have  very  different  brains  as  well,  like  scrub  jays  and  rooks  (Emery  and 
Clayton,  2001;  Bird  and  Emery,  2010).  We  know  from  research  in  macaques,  sheep  and  mice  that  social 
cognition  in  mammals  appears  to  rely  on  neural  circuits  that  are  similar,  and  perhaps  homologous,  to 
those  in  humans  (Barsalou,  2005;  Rudebeck  et  al.,  2006;  Tsao  et  al.,  2008;  Sanchez-Andrade  and 
Kendrick,  2009;  Jeon  et  al.,  2010;  Azzi  et  al.,  2012).  One  possible  explanation  is  that  we  inherited  those 
circuits  from  a  common  ancestor  that  possessed  some  level  of  social  complexity.  Alternatively,  similar 
constraints  applying  to  neural  circuits  could  also  have  caused  them  to  evolve  in  similar  ways  to  support 
similar  functions.  How  such  functions  are  accomplished  by  neural  circuits  in  animals  with  brains  that  are 
very  different  from  our  own  —  such  as  birds  —  remains  an  open  question. 

We  agree  with  Schilbach  et  al.  that  studying  the  neural  processes  mediating  live  interaction  between 
real  agents  is  crucial  for  the  maturation  of  social  neuroscience  as  a  discipline.  What  we  propose  is  to 


supplement  this  approach  with  formal  game  theory  and  value-based  analysis  of  preferences  in  humans 
and  nonhuman  animals.  In  our  lab,  for  example,  we  study  pairs  of  monkeys  interacting  both  in 
economical  and  interactive  games  (for  instance,  see  Chang  et  al.,  2011).  Estimating  preferences  allows 
us  to  quantify  how  much  monkeys  value  certain  options  (for  instance,  giving  juice  to  another  monkey). 
Game  theory  will  allow  us  to  generate  predictions  of  the  equilibriums  that  could  develop  over  time 
between  two  interacting  individuals  (see  Braun  et  al.,  2009).  Understanding  the  neural  processes  that 
underlie  social  cognition  in  such  animals  could  powerfully  inform  our  understanding  of  the  evolutionary 
origins  of  our  own  social  abilities. 
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